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MULTIMERIC FORMS OF TNF SUPERFAMILY LIGANDS 



BACKGROUND OF THE INVENTION 

Field of the Invention 

The present invention relates to a method form preparing soluble multimeric proteins 
consisting of more than three iterations of the same bioactive molecule using recombinant 
DNA technology. 

The present invention particularly concerns a new method of producing 
multimeric fusion proteins involving the TNF superfamily (TNFSF) members as a 
fusion proteins with SPD, and more specifically, CD40L-SPD fusion proteins and 
useful modifications thereof. 

Description of Related Art 

Numerous proteins can be made using modem molecular biology techniques and 
used in diagnostic and therapeutic apphcations. Using recombinant DNA techniques, the 
DNA encoding a single amino acid chain is constructed and then introduced into a cell 
which manufactures the final protein. Some cells, especially bacteria like E, coli, lack the 
ability to properly fold the amino acid chains into the proper quatemary structure and they 
often fail to apply the necessary modifications (e.g., glycosylation and disulfide bond 
formation) that are needed for the protein to be bioactive and resistant to degradation in vivo. 



While most of these challenges can be met by expressing the amino acid chain in eukaryotic 
cells like yeast or mammalian cells in vitro, it is not always straightforward to express 
proteins that consist of two or more amino acid chains. In general, for multichain proteins, 
the single amino acid chains must associate together in some way either within the producer 
5 cell or subsequently after the monomers are secreted from the producer cell. For artificially 
constructed molecules, the introduction into a single amino acid chain of an amino acid 
sequence which causes this chain-to-chain association can be an important step in producing 
multichain proteins. 

10 One of the most widely used methods of causing two amino acid chains to associate 

is to conjoin, at the DNA coding level, segments from the protein of interest and a segment 
from a spontaneously dimerizing protein. The best example is to conjoin or ftise a protein 
with the Fc portion of immunoglobulin, creating a dimeric Fc fusion protein (Fanslow et al, 
J. Immunol. 136:4099, 1986). A protein of this type can be formed from the extracellular 

15 domain of a tumor necrosis factor receptor fused to Fc (termed etanercept and marketed as 
ENBREL®), which is effective in the treatment of rheumatoid arthritis. A second example 
is the construction of a fusion protein between the dimerizing extracellular portion of CDS 
with the extracellular portion of CD40L(Hollenbaughe/ a/., ^ikffiO J. 11:4313, 1992). 
Here, the dimerizing CDS portion of the fiision protein helps to maintain the CD40L portion 

20 in the trimeric form needed for its bioactivity. A more recent example is the addition of an 
isoleucine zipper motif to CD40L, which permits the production of trimeric soluble CD40L 
molecules (Morris et aL, 1 Biol Chem. 274:418, 1999). 

The TNF superfamily (TNFSF) consists of an expanding number of proteins 
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(see Table I) which are crucial for the development and functioning of the immune, 
hematological, and skeletal systems. TNFSF proteins are ligands for a 
corresponding set of receptors of the TNF receptor superfamily (TNFRSF). All 
TNFSF members are expressed as Type II membrane proteins, with the exception of 
5 lymphotoxin-alpha which is produced as a secreted protein. However, soluble forms 
of several TNFSF proteins can be released from the cell surface by proteolytic 
cleavage, usually by specific metalloproteinases. 

The production of soluble forms of TNFSF proteins has been an important 
10 step in the study of these proteins. Soluble TNFSF Hgands can be used to study the 
activities of these proteins in vitro without the complexities in interpretation that 
result when cells or cellular membranes expressing TNFSF proteins are added. In 
addition, soluble forms of several TNFSF proteins have potential as therapeutic 
agents for human diseases. In particular, TNF-? has been extensively studied for the 
15 treatment of cancer and soluble CD40L is currently undergoing clinical trials to 
assess its antitumor effects. 

To produce soluble forms of TNFSF proteins, either the membrane protein is 
expressed in a cell line possessing a protease capable of separating the TNFSF 
20 extracellular domain from the transmembrane domain or a truncated form of the 
TNFSF protein is produced which consists solely of the extracellular domain plus a 
signal sequence. In either case, certain soluble forms of TNFSF proteins are unstable 
in solution as simple homotrimers composed solely of the extracellular domain. For 
example, naturally solubilized TNF-? is labile under physiological conditions 
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[Schuchmann, 1995 #129]. To solve this stability problem, chimeric proteins have 
been constructed according to one of four different design principles: (1) The 
extracellular portion of the TNFSF protein has been expressed fused to the dimeric 
portion of the inmiunoglobulin Fc fragment US Patent 5,155,027, Oct. 13, 1992^ 

5 issued to, Andrzej Z. Sledziewski, et al. In the case of CD40L and OX40L, this 
yields a soluble molecule v^hich is significantly less active than the native membrane 
form of this protein. (2) The extracellular portion of the TNFSF protein has been 
expressed with an antigenic tag (usually the FLAG motif) fused to its N-terminus 
[Mariani, 1996 ]. The addition of an antibody to the tag (e.g., anti-FLAG antibody) 

10 aggregates these proteins into a multimeric form. Crosslinking enhances activity on 
B cells. (3) The extracellular portion of the TNFSF protein has been expressed fused 
to the spontaneously dimerizing extracellular portion of the CDS molecule 
[HoUenbaugh, 1992 ]. In the case of CD40L, this creates a hexameric molecule 
[PuUen, 1999 ] which is likely formed by two CD40L trimers attached to three CDS 

15 dimeric stalks. Despite this, the addition of an anti-CDS antibody to crosslink the 
CD40L-CD8 fusion protein yields a further enhancement of CD40L activity on B 
cells. (4) The extracellular portion of the TNFSF protein has been expressed fused to 
a trimerizing isoleucine zipper which maintains the overall trimeric structure of the 
protein [US Patent5,71 6,805, Feb. 10, 1998, issued to Subashini Srinivasan et al. 

20 This soluble CD40L trimer or 'sCD40LT' is the form of that protein now being 
clinically tested in humans for its anti-tumor effects. 

Compounding the difficulties in producing stable forms of soluble TNFSF proteins 
are compromises in bioactivity. As exemplified by FasL, TNF, and CD40L, many of the 
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soluble forms of these proteins lack the full range of stimulatory activities displayed by the 
membrane forms of these molecules. For FasL, several groups have reported that naturally 
produced soluble FasL (generated by proteolytic cleavage from the membrane form) has a 
spectrum of activities that is distinctly different from the membrane form. Soluble FasL 

5 induces apoptosis in activated CD4+ T cells but not fresh, resting CD4+ T cells. In contrast, 
both types of CD4-f- T cells are killed by membrane FasL or a recombinant soluble form of 
FasL (WXl) that spontaneously aggregates into oligomers larger than a decamer. For TNF, 
T cell activation through stimulation of TNFR II, the 80 kDa receptor for TNF, is much 
greater with membrane TNF than soluble TNF. However, if soluble TNF is produced as a 

10 tagged protein and crosslinked with an antibody against the tag, then it completely mimics 
the activities of membrane TNF [Schneider, 1998]. Finally, for CD40L, the stimulatory 
effects of a soluble form of this TNFSF protein are enhanced by crosslinking [Kehry, 1994] 
and yields an activity similar to membrane CD40L. For example, soluble CD40L-CD8 
fusion protein requires crosslinking with a antibody to CDS in order to drive resting B cells 

15 to prohferate to a degree similar to membrane-bound CD40L. ). Even more strikingly, 

although membrane-bound CD40L expressed on baculovirus-transduced SF9 insect cells is a 
strong B cell stimulus, small vesicles (10 - 1,500 nm) prepared from the membranes of these 
cells are less stimulatory. However, ultracentrifugation of these vesicles creates aggregates 
which have the full activity of the original membrane CD40L protein. This indicates that B 

20 cells are more highly stimulated by a large surface of CD40L than they are by a smaller 
surface expressing this membrane Ugand. 

Taken together, the above reports suggest that, for some TNFSF/TNFRSF 
ligand/receptor pairs at least, it is essential to cluster receptors together for fall 
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signaling activity. By this interpretation, the efficacy of the membrane forms of 
FasL, TNF, and CD40L occurs because these hgands can move in the plane of the 
membrane toward the contact zone with a receptor-bearing responding cell, thereby 
clustering ligated receptors to form a receptor-dense region of the membrane. This 
5 interpretation is further supported by experiments where crosslinking of a soluble 
TNFSF protein effectively mimics the activity of the membrane form of the protein 
[Scheider, 1998]. 

In all of the above examples, no more than three amino acid chains have been caused 
10 to associate together. There is a need to produce multimeric protein molecules where more 
than three amino acid chains are caused to associate into a single soluble molecular complex. 
An important example comes from studies of CD40L (also called CD 154 or TNFSF5), 
which is a member of the TNF family of molecules that are normally expressed as insoluble, 
cell membrane proteins. It has been shown that soluble homotrimers composed of the 
1 5 extracellular regions of CD40L, TNF, and FasL are not potently active on resting cells that 
bear receptors for these proteins. However, if these proteins are expressed with a tag on 
their ends (e.g., the FLAG peptide sequence) and then the trimers are extensively crosslinked 
using an antibody to FLAG, full activity appears (Schneider et al, J. Exp. Med, 187:1205, 
1998). From this, it can be inferred that the soluble single-trimer forms of these molecules 
20 does not duphcate the multivalent interactions that normally occur when a receptor-bearing 
cell comes in contact with the membrane of a cell expressing numerous ligand trimers on its 
surface. This distinction may be due to a need for receptor clustering for full signaling 
(Bazzoni and Beutler, K Engl 1 Med, 334:1717, 1996), which in turn is only possible with 
a multimeric Ugand engaging many receptors at the same time in a localized region of the 
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cell membrane. 



SUMMARY OF THE INVENTION 

5 The present invention contemplates a method of preparing soluble, 

multimeric mammahan proteins by culturing a host cell transformed or transfected 
with an expression vector encoding a fusion protein comprising the hub, body, and 
neck region of a coUectin molecule and a heterologous mammalian protein. 

10 In one embodiment, the heterologous mammalian protein comprises an extra 

cellular domain of a mammahan transmembrane protein; the resuUing fusion protein 
forms a multimer. 

In another embodiment, the heterologous mammalian protein comprises a 
15 soluble protein such as a cytokine; the resulting fusion protein forms a multimer. 

In another embodiment, sites of proteolytic degradation are included or 
removed from the fusion protein; the resulting fusion protein forms a multimer from 
which are cleaved single xmits at a rate made variable by the nature of the proteolytic 
20 digestion sites either included or excluded. 

In yet another embodiment, special attention is given to the immunogenicity 
of the fusion protein by altering the junction between the two naturally occurring 
proteins from which it is made; the resulting fusion protein may be less or more able 
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to elicit an immune response against itself, which could lengthen its persistence or 
contribute to it immunological effectiveness. 

A hybrid nucleotide sequence of no more than 1528 base pairs including a sequence 
5 defining a structural gene expressing a conjoined single strand of a multimeric TNFSF-SPD 
fusion protein, said structural gene having a nucleotide base sequence selected from 
members of the group consisting of SEQ ID NO 1, SEQ ID NO 3 and SEQ ID NO 5 is 
disclosed by this invention. In one embodiment, the DNA segment the structural gene has a 
sequence expressing a single hybrid amino acid chain of TNFSF-SPD, the segment having a 
10 first SPD nucleotide base sequence of SEQ ID NO 1, from base 32 to base 799, and a second 
sequence, expressing a portion of TNFSF stalk, selected from members of the group 
consisting of SEQ ID NO 1, from base 800 to base 1444, SEQ ID NO 3, from base 800 to 
base 1528, and SEQ ID NO 5, from base 800 to base 144L 

15 In another embodiment, a recombinant DNA molecule has vector operatively linked 

to an exogenous DNA segment defining a structural gene expressing a single amino acid 
chain of TNFSF-SPD. This structural gene has a nucleotide base sequence selected from 
members of the group consisting of SEQ ID NO 1, SEQ ID NO 3 and SEQ ID NO 5, any 
ftmctional equivalents and modifications thereof There is also attached an appropriate 

20 promoter for driving the expression of said structural gene in a compatible host organism. 
The organism can be E. coU, a yeast, a higher plant or animal. 

Yet another embodiment contemplated by the invention is multimeric TNFSF-SPD 
fusion protein having a plurality of polypeptide trimers, a first trimer consisting of peptide 
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strands of members of the TNF superfamily (TNFSF) of ligands, and a second trimer strand 
from a coUectin molecule, each first trimer conjoined to a second polypeptide trimer strand 
from a coUectin molecule, wherein said hgand strand is substituted for native carbohydrate 
recognition domains (CRD) of the coUectin molecules. The conjoined coUectin strands are 

5 covalently bound in paraUel to each other, forming a multimeric fusion protein comprising a 
plurality of trimeric hybrid polypeptide strands radiating from a covalently bound center hub 
of the molecule. The free end of each trimeric radiating strand has a TNFSF moiety 
attached. The TNFSF moiety is one selected from the group consisting of ligands LTA, 
TNF, LTB, and TNFSF4 to TNFSF 18 as shown in Table 11, and their fimtional equivalents, 

1 0 and modifications thereof 



The invention also contemplates a method for preparing a CD40-SPD multimeric 
ftision polypeptide, including the steps of initiating a culture, in a nutrient medium, of 
procaryotic or eucaryotic host cells transformed with a recombinant DNA molecule 
15 including an expression vector, appropriate for the cells, operatively linked to an exogenous 
DNA segment defining a structural gene for CD40-SPD ligand. The structural gene has a 
nucleotide base sequence of SEQ ID NO 1 from about base 32 to about base 1444. 
Thereafter, the culture is maintained for a time period sufficient for the cells to express the 
multimeric molecule. 

20 

Also contemplated is a method of producing a secreted, very large, biologically 
active, multimeric tumor necrosis factor superfamily hgand fixsion protein chimera that is 
highly immunogenic and not readily diffusable. The steps for this method are as follows: 
1 . introducing into a host cell a first chimeric DNA construct including a 
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transcriptional promoter operatively linked to a first secretory signal sequence, followed 
downstream by, and in proper reading frame with a first DNA sequence encoding a 
polypeptide chain of a first TNFSF ligand requiring multimerization for biological activity. 
This sequence is joined to a second DNA sequence encoding a coUectin polypeptide at the 
5 site where the collectings CRD was purposefiiUy removed. 

2. introducing into the host cell, a second DNA construct including a transcriptional 
promoter operably linked to a second secretory signal sequence followed downstream by, 
and in proper reading fi'ame with, a third DNA sequence encoding a second polypeptide 
chain of a second TNFSF ligand , joined to a fourth DNA sequence encoding a coUectin 

10 polypeptide , wherein the collectings CRD was purposefully removed, and then, 

3. growing the host cell in an appropriate growth medium under physiological 
conditions to allow the secretion of a large multimerized polypeptide fiision protein, wherein 
the first polypeptide chain of a TNFSF-SPD protein is bound by parallel bonding of the 
respective coUectin domain trimer to the second polypeptide chain of a different TNFSF- 

15 SPD polypeptide trimer, and wherein the multimerized polypeptide fiision protein exhibits 
biological activity characteristic of both membrane-attached TNFSFs, and 

4. isolating the biologically active, multimerized TNFSF-SPD polypeptide fiision 
fi*om said host cell. The chimeric reactant compoimds are humanized to guard against 
destruction by a potential human recipient's immune system. 

20 

A final method of preparing a multimeric TNFSF-SPD ligand fiision protein 
contemplated requires a) preparing a first DNA segment coding for a strand of an exposed 
extracellular portion of TNFSF; b) preparing a second DNA segment coding for a coUectin 
polypeptide strand, wherein the coUectin' s CRD domain of the strand has been removed; c) 



10 



conjoining the first and second DNAs in proper reading frame, thereby creating a TNFSF- 
coUectin DNA construct; d) inserting the construct into an expression vector system; e) 
introducing the vector system into an appropriate cell in culture under suitable conditions; f) 
harvesting and purifying spent medium from the culture; and finally g) assaying for presence 
5 of multimeric TNFSF-coUectin ftision protein. 

A method for stimulating the immune response in potentially immonocompetent 
cells using multimeric TNFSF fiision proteins by contacting the cells with the multimeric 
TNFSF fiision proteins, causing the cells to proliferate, is also contemplated. The cells used 
10 may be resting B cells. There is also a method for increasing antigenicity of cells by 

contacting the cells with the multimeric TNFSF fiision proteins. In this case, the cells may 
be tumor cells or HFV positive cells. 

Other preferred embodiments contemplate the methods of preparation 
1 5 described above, wherein the host transformed is either a prokaryote, such as E. cph , 
a eukaryote, for example yeast, such as S. cerevisiae . or a higher plant, such as alfalfa 
or tobacco. 

Still fiirther embodiments and advantages of the invention will become 
20 apparent to those skilled in the art upon reading the entire disclosure contained 
herein. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1. Structure of the CD40L-SPD fusion protein. The extracellular portion of the 
CD40L homotrimer, including its membrane-proximal stalk, was fused to the body of SPD. 
The N-temiinus of SPD contains two cysteines which link the homopolymer together by 
disulfide bonds forming a hub. The trimeric collagenous stalk extend from the hub as a 
cruciate structure and end in a spontaneously trimerizing neck region. The amino acid 
domains in a single chain of the CD40L-SPD are shown at the top. At the bottom is the 
tetrameric (four CD40L trimers) which is expected to form. In addition, the hub region of 
SPD can participate in stacking up to 8 or more cruciate forms into higher order aggregates. 

Figure 2. Ion-exchange chromatography of murine CD40L-SPD. CHO cells expressing 
murine CD40L-SPD were grown in serum-free media, concentrated using a 100 kDa cutoff 
ultrafiltration membrane, and diafiltered into 50 mM bicine, pH 9.0, 1 mM EDTA. Using an 
FPLC system, the protein from 400 mL of media was applied to a Fractogel S0{ 650M 
column and eluted with a linear salt gradient. 3 mL samples were collected. Shown are 
curves for protein concentration (ODjgo), conductivity as % 1 M NaCl in the buffer, and 
ELISA-detectable CD40L-SPD assayed at 1:100 dilution. 

Figure 3. Size fractionation of murine CD40L-SPD by ultrafiltration. CD40L-SPD is a 
471 amino acid protein with a predicted molecular weight of 49,012 for each of the twelve 
component chains in the dodecamer (composed of four trimeric subunits). This does not 
include added carbohydrates. Therefore, the full dodecamer will have a molecular weight in 
excess of 600,000. However, from the literature on recombinant surfactant protein D made 
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in CHO cells, it appears that some of the product will be in the form of trimers that are not 
part of a cruciate-formed dodecamer. To determine what percentage of CD40L-SPD was 
produced in a multimeric form, supernatant from the transfected CHO cells were passed 
through filters of different porosities (rated for their ability to retard globular proteins). An 
ELISA was used to detect the amount of CD40L-SPD (measured at multiple dilutions) that 
passed through the filter. As shown, about 90% of the protein is retained by a 300,000 kDa 
cut-off filter. This indicates that most of the protein is in the dodecameric form. In addition, 
the cruciate dodecamers of surfactant protein D can also stack on top of each other into even 
higher molecular weight forms. This is the likely explanation for the small fraction of 
CD40L-SPD that is retained by the 1,000 kDa cut-off filter. 

Figure 4. Activation of human B cells by human CD40L-SPD. Conditioned media from 
CHO cells expressing human CD40L-SPD was added to human B cells along with IL-4. hi 
the left panel, the cells were stained with CyChrome-labeled anti-CD 19 to identify B cells 
and PE-labeled anti-CD3 to identify T cells. As shown, most of the cells proliferating in the 
culture were CD19+ CD3- B cells. In the right panel, the cells were stained with 
CyChrome-labeled anti-CD19 to identify B cells and PE-labeled anti-CD80 (B7-1) to 
identify this co-stimulatory molecule. As shown, almost all of the B cells were induced by 
CD40L-SPD to express CD80. 

Figure 5. Activation of murine B ceUs by murine CD40L-SPD. Murine CD40L-SPD 
was added to resting murine splenic B cells for a two day culture period. For the final 4 
hours, the cultures were pulsed with ^H-thymidine, following which the cells were harvested 
and DNA synthesis was measured by scintillation counting. As shown, CD40L-SPD is 
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nearly as effective as anti-IgM in promoting the proliferation of resting B cells. 

Figure 6. CD40L-SPD stimulation of macrophage chemokine production. Conditioned 
media from CHO cells expressing human CD40L-SPD, an inactive mutant of human 
CD40L-SPD (T147N-CD40L-SPD), or murine CD40L-SPD (mCD40L-SPD) were added to 
cultures of human monocyte-derived macrophages. As a negative control, this media was 
heat-inactivated at 60 °C for 30 minutes. Also shown is a form of soluble CD40L (sCD40L) 
consisting of 149 amino acids from the extracellular domain of human CD40L (Peprotech) 
added at 1 ?g/mL. 24 hours later, supematants were collected and assay for MIP-1? by 
ELISA (R&D Systems). The weak activity of soluble single-trimer CD40L (sCD40L) is 
apparent. In contrast, native human and murine CD40L-SPD strongly activated the 
macrophages to produce MIP-1?. In contrast, heat-inactivated CD40L-SPD was inactive. 
As expected, the inactive mutant, T147N-CD40L-SPD, also failed to stimulate 
macrophages, demonsfrating that the CD40L portion and not the SPD portion of the protein 
was responsible for stimulating the macrophages. 

Figure 7. Expression of RANKL/TRANCE-SPD production from CHO ceUs detected 
by ELISA. Antibodies against RANKL/TRANCE were used to construct an ELISA 
capable of detecting the RANKL/TRANCE protein. As shown, there was no background 
with the media confrol. Using a ftision protein between CD70 (CD27L or TNFSF7) and 
SPD, there was also no signal, indicating the specificity of the ELISA. However, using 
CHO cells fransfected with an expression plasmid for CD70-SPD, immunoreactive secreted 
protein was clearly detectable. This demonstrates the generahzability of the method for 
expressing TNFSF members as ftision protems with collectins such as SPD. 
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DESCRIPTION OF THE PREFERRED EMBODIMENT 
1. Definition of Terms 

Multimeric: As used herein the term multimeric refers to a multimer of a 
polypeptide that is itself a trimer (i.e., a plurality of trimers). 

Functional Equivalent: Herein refers to a sequence of a peptide or 
polypeptide that has substantial structural similarity and functional similarity to 

another such sequence. 

Modifications: Herein refers to point changes involving single amino acids, 
wherein the functionality is altered, without appreciably altering the primary 
sequence or primary structure of a peptide or polypeptide. 

Amino Acid: All amino acid residues identified herein are in the natural L- 
configuration. In keeping with standard polypeptide nomenclature, .T. Biol. Chem., 
243:3557-59, (1969), abbreviations for amino acid residues are as shown in the 
following Table of Correspondence: 



15 



TABLE OF CORRESPONDENCE 



SYMBOL AMTNO ACID 





1 -T etter 


3 -Letter 




5 


Y 


Tyr 


L-tyrosine 




G 


Gly 


glycine 




F 

JL 


Phe 


L-phenylalanine 




M 


Met 


L-methionine 


10 


A 


Ala 


L-alanine 




S 


Ser 


L-serine 




L 


He 


L-isoleucine 




L 


Leu 


L-leucine 




T 


Thr 


L-threonine 


15 


V 


Val 


L-valine 




P 


Pro 


L-proline 




K 


Lys 


L-lysine 




H 


His 


L-histidine 




Q 


Gin 


L-glutamine 


20 


E 


Glu 


L-glutamic acid 




w 


Trp 


L-tryptophan 




R 


Arg 


L-arginine 




D 


Asp 


L-aspartic acid 




N 


Asn 


L-asparagin 


25 


C 


Cys 


L-cysteine 



It should be noted that all amino acid residue sequences are represented herein by 
formulae whose left to right orientation is in the conventional direction of amino- 
terminus to carboxy-terminus. Furthermore, it should be noted that a dash at the 
30 beginning or end of an amino acid residue sequence indicates a bond to a radical 
such as H and OH (hydrogen and hydroxyl) at the amino- and carboxy-termini, 
respectively, or a further sequence of one or more amino acid residues up to a total of 



about fifty residues in the polypeptide chain. 

Base Pair (bp): A partnership of adenine (A) with thymine (T), or of cytosine 
(C) with guanine (G) in a double stranded DNA molecule. 

Constitutive promoter: A promoter where the rate of RNA polymerase 
5 binding and initiation is approximately constant and relatively independent of 

external stimuli. Examples of constitutive promoters include the cauHflower mosaic 
virus 35S and 19S promoters described by Poszkowski et al, EMBOl, 3:2719 
(1989) and Odell et al, Nature, 313:810 (1985). 
DNA: Desoxyribonucleic acid. 
10 Enzyme: A protein, polypeptide, peptide RNA molecule, or multimeric 

protein capable of accelerating or producing by catalytic action some change in a 
substrate for which it is often specific. 

Expression vector: A DNA sequence that forms control elements that 
regulate expression of structural genes when operatively linked to those genes. 
15 Expression: The combination of intracellular processes, including 

transcription and translation undergone by a structural gene to produce a polypeptide. 

Insert: A DNA sequence foreign to the rDNA, consisting of a structural 
gene and optionally additional DNA sequences. 

Nucleotide: A monomeric unit of DNA or RNA consisting of a sugar moiety 
20 (pentose), a phosphate, and a nitrogenous heterocyclic base. The base is linked to the 
sugar moiety via the glycosidic carbon {V carbon of the pentose) and that 
combination of base and sugar is a nucleoside. When the nucleoside contains a 
phosphate group bonded to the 3' or 5' position of the pentose it is referred to as a 
nucleotide. 
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Operatively linked or inserted: A structural gene is covalently bonded in 
correct reading frame to another DNA (or RNA as appropriate) segment, such as to 
an expression vector so that the structural gene is under the control of the expression 
vector. 

5 Polypeptide and peptide: A linear series of amino acid residues connected 

one to the other by peptide bonds between the alpha-amino and carboxy groups of 

adjacent residues. 

Promoter: A recognition site on a DNA sequence or group of DNA 
sequences that provide an expression control element for a gene and to which RNA 
10 polymerase specifically binds and initiates RNA synthesis (transcription) of that 
gene. 

Inducible promoter: A promoter where the rate of RNA polymerase 
binding and initiation is modulated by external stimuU. Such stimuH include light, 
heat, anaerobic stress, alteration in nutrient conditions, presence or absence of a 

1 5 metabolite, presence of a ligand, microbial attack, wounding and the Uke. 

Spatially regulated promoter: A promoter where the rate of RNA 
polymerase binding and initiation is modulated in a specific structure of the organism 
such as the leaf, stem or root. Examples of spatially regulated promoters are given in 
Chua et al.. Science, 244:174-181 (1989). 

20 Spatiotemporally regulated promoter: A promoter where the rate of RNA 

polymerase binding and initiation is modulated in a specific structure of the organism 
at a specific time during development. A typical spatiotemporally regulated 
promoter is the EPSP synthase-35S promoter described by Chua et al., Science, 
244:174-181 (1989). 
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Temporally regulated promoter: A promoter where the rate of RNA 
polymerase binding and initiation is modulated at a specific time during 
development. Examples of temporally regulated promoters are given in Chua et al., 
Science, 244:174-181 (1989). 

Protein: A linear series of greater than about 50 amino acid residues 
connected one to the other as in a polypeptide. 

Recombinant DNA molecule: A hybrid DNA sequence comprising at least 
two nucleotide sequences not normally foimd together in nature. 

RNA: Ribonucleic acid. 

Selective Genetic marker: A DNA sequence coding for a phenotypical trait 
by means of which transformed cells can be selected from untransformed cells. 

Structural gene: A DNA sequence that is expressed as a polypeptide, i.e., an 

amino acid residue sequence. 

Synthetic promoter: A promoter that was chemically synthesized rather 
than biologically derived. Usually synthetic promoters incorporate sequence changes 
that optimize the efficiency of RNA polymerase initiation. 

2. Introduction 

This invention discloses the production of TNFSF proteins as multimeric (i.e., many 
trimers) ligands fused onto a trimeric, branched protein backbone. CoUectin molecules are 
ideal for this purpose because they are formed from many trimeric, collagenous arms linked 
to a cenfral hub by disulfide bonds. Of the coUectins, pulmonary surfactant protein D (SPD) 
was chosen initially because it is a homopolymer encoded by a single gene, unlike Clq and 
surfactant protein A, which are composed of two different protein subunits. In addition, 
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recombinant SPD has been successfully expressed in vitro in reasonable yield [Crouch, 
1994], and a peptide containing the "neck" region of SPD was shown to spontaneously 
trimerize in solution [Hoppe, 1994]. Consequently, extracellular domains of human and 
murine CD40L were substituted for the carbohydrate recognition domain of pulmonary 

5 surfactant D (SPD) to create a four-armed molecule (three peptide chains per arm) with 
CD40L at the end of each arm. This molecule is named CD40L-SPD. Li addition, because 
SPD tends to stack into higher order aggregates with up to 8 molecules associated at the hub 
[Crouch], even greater degree of multimerization can occur [Lu, 1993]. CD40L-SPD 
therefore mimics the expression of CD40L by an activated T cell in that it presents a 

10 multivalent complex similar to membrane-bound CD40L. While remaining soluble, 
CD40L-SPD equals membrane CD40L in its range of activities. 

3. ronstruction of eYpressinn plasmid s for rn40L-SPD. 

cDNAs of exposed human and murine CD40L, removed from cell 
15 membranes, were cloned by PCR by well-known methods. Murine surfactant protein 
D was cloned by hemi-nested PCR from murine lung mRNA (Clonetech). cDNA 
was prepared using Superscript II reverse transcriptase (Life Technologies, 
Gaithersburg, MD) and random hexamers as primers. PCR primer sequences were as 
follows (the underlined bases indicate restriction endonuclease sites for cloning into 
20 the vector): 
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inSPD5 : 5'-CTGACATGCTGCCCTTTCTCTCCATGC-3' 
mSPD3ext: 5'-GGAGGCCAGCTGTCCTCCAGCCTGTTTGC-3' 

rmSPDS: S'-GGGQimGCGAATTCCACCAGGAAGCAATCTGACATGCTGCCCTTT- 

CTCTCCATGC-3' 

CD40L/SPD3: 5'-TCTATCTTGTCCAACCTTCTATG/GCCATCAGGGAACAATGCAGCTTTC-3' 
SPD/CD40L5: 5'-AAAGCTGCATTGTTCCCTGATGGC/CATAGAAGGTTGGACAAGATAGAAG-3 
CD40L3 : 5'-GGGCTCGAGGTACCAGTTCTACATGCCTTGGAGTGTATAAT-3' 
SPD/mCD40L5 : 5'-GAAAGCTGCATTGTTCCCTGATGGC/CATAGAAGATTGGATAAGGTCGAAG 
mCD40L/SPD3 : 5'-CTTCGACCTTATCCAATCTTCTATG/GCCATCAGGGAACAATGCAGCTTTC-3 
mCD40L3 : 5'-GGGGGQlAC£CTGCTGCAGCCTAGGACAGCGCAC-3' 

Because the murine SPD sequence of the 5' untranslated region containing 
the ribosomal binding site was unknown when this work was started [Motwani, 
1995], a primer (rmSPDS) was designed based on the available rat sequence 
[Shimizu, 1992] which extended the 5' end with rat sequence (shown in bold) along 
with an added Nhe I site (underlined). 

4. rreation of the CT)40T.-SPD Fusions . 

To create the CD40L-SPD fusions, overlap PGR was used. Murine SPD was 
amphfied by nested PGR using mSPD5 and mSPDSext for the first round of 30 
cycles. The product was diluted 1:1,000 and 1 ?L was amplified for another 30 
cycles using rmSPDS and CD40L/SPD3, where the 3' half of GD40L/SPD3 is a 
reverse primer for SPD G-terminal to the neck region (deleting the GRD) and the 5' 
half of GD40L/SPD3 contains bases firom the N-terminus of the extracellular portion 
of GD40L (immediately adjacent to the transmembrane region). Similarly, the 
GD40L plasmid was amplified with SPD/GD40L5 and GD40L3, which contains a 
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Kpn I site (underlined). All of these PCRs were performed with Pfu cloned 
polymerase (Stratagene,) using hot start (Ampliwax, Perkin-Elmer) and the 
thermocycling program: 94 ""C for 2,5 min; then 30 cycles of 94 ""C for 10 sec, 43 ""C 
for 30 sec, and 75 °C for 7 min. 

5 

To form the chimeric construct, 1 fxL of a 1:1,000 dilution of gel-purified 
products fi-om the above reactions was combined and ampUfied with rmSPD5 and 
CD40L3. Because Pfu polymerase did not consistently yield the expected 1.62 kb 
overlap product, AccuTaq LA DNA polymerase (Sigma) was used for this PGR, 

10 using the thermocychng program: 94 ""C for 2.5 min; then 30 cycles of 98 ""C for 20 
sec, 43 ""C for 30 sec, and 68 ""C for 10 min. The resulting product was digested with 
Nhe I and Kpn I, gel-purified, and Ugated into the Nhe I and Kpn I sites in the 
expression plasmid, pcDNA3.1(+) (Invitrogen, Carlsbad, CA). DH5 E. coli were 
transformed with the construct and plasmid DNA was purified either by double 

15 banding in ethidixmi bromide-CsCl gradients or by anion exchange resin (QIAgen). 
To form the T147N-CD40L-SPD construct, the same approach was used except that 
the CD40L coding region was taken fi:om the expression plasmid for T147N-CD40L 
[Kombluth]. The amino acid sequence at the junction between SPD and CD40L is 
...KAALFPDG/HRRLDKIE..., where the C-terminal portion begins the sequence for 

20 CD40L. To form mCD40L-SPD, a similar approach was taken except that primers 
SPD/mCD40L5, mCD40L/SPD3, and mCD40L3 were used for amplifications 
involving murine CD40L sequences. The amino acid sequence at the junction 
between SPD and murine CD40L is ...KAALFPDG/HRRLDKVE..., where the C- 
terminal portion begins the sequence for murine CD40L. Both DNA strands of each 

25 construct were sequenced to confirm that the constructs were correct. In other 

experiments, an entirely humanized construct, consisting of human CD40L fiised to 
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human SPD, was constructed (data not shown). 



5. Construction of expression plasmid for murine RANKL/TRANCE (TNFSF 11). 

Spleen cells from C3H/HeJ mice were stimulated with 5 p-g/ml concanavalin A and 10 
5 ng/ml IL-2 (Sigma) for 8 hours (31). mRNA was isolated using the Micro FastTrack kit 
(Invitrogen). cDNA was prepared using Superscript n reverse transcriptase (Life 
Technologies) and random hexamers as primers. PGR primer sequences were as follows 
(where the imderlined bases indicate restriction endonuclease sites for cloning into the 
vector): 

10 5mRANKL-ext: 5'-CATGTTCCTGGCCCTCCTC-3' 

3mRANKL-ext: 5'-GTACAGGCTCAAGAGAGAGGGC-3' 

5mRANKL-int: 5'-AT ACTCGAG CGCAGATGGATCCTAAC-3' 

3mRANKL-mt: 5'-GGGGTTTAGCGGCCGCTAATGTTCCACGAAATGAGTTC-3' 

15 The extracellular portion of RANKL/TRANCE was cloned by nested PCR. In the 

first round of PCR, 5mRANKL-ext and SmRANKL-ext were used with Pfii cloned 
polymerase (Stragene) usmg the thermocycling program: 94 °C for 2.5 min; then 30 cycles 
of 94 °C for 10 sec, 50 °C for 30 sec, and 75 °C for 2 min. The product was diluted 1:1,000 
and 1 ^L was amplified for another 30 cycles using 5mRANKL-int and 3mRANK-int, 

20 which contain an Xho I site and a Not I site respectively. The resulting product was digested 
with Xho I, blunt-ended with T4 DNA polymerase, then digested with Not I and gel- 
purified. The CD40L-SPD expression plasmid described above was digested with Msc I and 
Not I and gel purified. Then the RANKL/TRANCE sequence was Ugated into this vector in 
fi:ame with the SPD coding sequence. The amino acid sequence at the junction between 

25 SPD and RANKL/TRANCE is ...KAALFPDG/RAQMDPNR..., where the N-terminal 
portion is from SPD and the C-terminal portion is the extracellular sequence of 
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RANKL/TRANCE. Both DNA strands of each construct were sequenced to confirm that 
the constructs were correct. 

6. Stable transfection of DHFR-deficient CHO cells and amplification, 

5 DG44 (a line of CHO-Kl cells deficient in dihydrofolate reductase (DHFR)) (32) and 

pCHlP (a plasmid containing the hamster DHFR minigene) (33) were gifts fi-om Dr. 
Lawrence Chasin, Columbia University, New York, NY . DG44 cells were cultured in a- 
MEM consisting of ribo- and deoxynucleoside-free a-MEM (BioWhittaker, Walkersville, 
MD) supplemented with 200 [iM L-glutamine, 10% fetal bovine serum (FBS) and 10 ^g/ml 

10 each of adenosine, deoxyadenosine, and thymidine (Sigma). All cell cultures described 
were negative in a mycoplasma rRNA assay (Gen-Probe, San Diego). DG44 cells in six- 
well plates were transfected by the method of Okayama and Chen ((34) with 10 |j.g of 
expression plasmid and 0.05 |ag of pCHlP (200:1 ratio). After two days, the transfected 
DG44 were trypsinized and transferred to 100 mm plates. At this point, the media was 

15 switched to a"-MEM which differs fi-om a-MEM in that dialyzed FBS (HyClone Systems, 
Logan, UT) was used and no nucleoside supplements were added. Only cells containing the 
DHFR minigene were able to grow in a'-MEM, and colonies were selected after 10 days, 
cloned using cloning rings, and transferred to 12.5 cm flasks. Clones were selected for 
expansion using an ELISA to screen for the production of either murine or human CD40L 

20 (see below). Using the method described by Kingston et al (35), escalating doses of 

methotrexate were used to amplify the transfected genes over a period of 6-14 months until 
the cells grew well in 80 jiM methotrexate. Each expressing clone was re-cloned once or 
twice more in order to select the highest expressing cells. 

25 7, Preparation of human and murine CD40L-SPD in serum-free media . 

Selected clones were adapted for growth in nucleoside-free UltraCHO media 
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(BioWhittaker) supplemented with 50-100 \xg/mL ascorbic acid and 50 \xM 
methotrexate (Sigma). The non-adherent population was further adapted for 
suspension growth in roller bottles. In some experiments, the cells were adapted 
from a'-MEM to CHO-S-SFM II media (Life Technologies) supplemented with 
5 ascorbic acid and 50 fig/mL L-proline. 

8. ELISA assay for human and murine CD40L-SPD . 

To assay for correctly folded CD40L, wells of a MaxiSorb 96-well plate 
(Nunc) were coated overnight at 4 "^C with 50 ?L of carbonate-bicarbonate, pH 9.40 

10 buffer containing 0.5 ^ig/mL 24-3 1 anti-human CD40L MAb (Ancell) or MRl anti- 
murine MAb (Bioexpress, Lebanon, NH). Wells were blocked with 3% bovine 
serum albumin (BSA) in PBS. 100 |liL samples were added to the wells either neat 
or diluted in a dilution buffer consisting of 1% BSA, 0.9 % NaCl, 50 mM Tris pH 
7.40, and 0.1% peroxide-free Tween 20 (Sigma). After shaking for 2 h at 600 RPM, 

15 a plate washer was used to wash the plate four times with 0.9 % NaCl, 50 mM Tris 
pH 7.40, and 0.1% peroxide-free Tween 20. Then, 100 nL of diluent buffer 
containing 1 [ig/mL biotinylated 24-31 anti-hiunan CD40L Mab (Ancell) or MRl 
anti-murine CD40L Mab (Pharmingen, San Diego, CA) was added to each well and 
again shaken for 2 h. Following another four washes, 100 \iL of diluent buffer 

20 containing 1 jLig/mL of streptavidin-alkaline phosphatase (Jackson) was added to 
each well and the plate was shaken for 1 hour. Lastly, after another four washes, 
color was developed for 10-20 min using 100 jiL/well of BluePhos (Kierkegaard & 
Perry), stop solution was added, and the wells were read at 650 |xm in a plate reader. 

9. Purification of human and murine CD4QL-SPD, 

25 Conditioned UltraCHO media was filtered using a 0.2 \i PES filter unit 

(Nalgene) and stored at 4 for up to 3 months. A preliminary size fractionation 
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was performed by ultrafiltration through a 100 kDa-cutoff 76 mm membrane (YM- 
100, Millipore) in a 400 mL stirred cell at 10 Ibs/sq. inch pressure of argon. Media 
was concentrated to about 10 mL, diluted to 100 mL with buffer, and again 
concentrated to 10 mL for a total of 3 cycles of ultrafiltration and buffer exchange. 
5 Buffer was 50 mM Bicine (Calbiochem), adjusted to pH 9.0 with NaOH (about 32 
mM Na), and 1 mM EDTA to prevent the activity of any metalloproteinase. Using 
FPLC equipment (Amersham-Pharmacia), the concentrate was filtered through a 
0.45 |i filter, placed into a 10 mL superloop, appUed to a 10 X 30 mm column 
(HRlO/30, Amersham-Pharmacia) packed with Fractogel S0{ 650M (EM 

10 Biosciences), and eluted at 0.5 mL/min at 4 with a linear gradient of 0-500 mM 
NaCl in buffer. As described by the manufacturer, the resolution of proteins on 
Fractogel SO3" is enhanced by using a long, thin column geometry. Fractions were 
collected and screened for human or murine CD40L by ELISA. Positive firactions 
were pooled, concentrated by ultrafiltration (CentriPrep-30, Millipore), filtered 

15 through a 0.45 \x filter, and applied to a Superose 6 column (Amersham-Pharmacia) 
in phosphate-buffered saline. 

10, Murine B cell cultures. 

C3H/HeJ mice were euthanized by CO2 inhalation under a protocol approved by the 
20 Animal Subjects Committee of the San Diego VA Healthcare System. Splenocytes were 
isolated by centrifiagation over Lympholyte-M (Accurate Chemical & Scientific Corp.. 
Westbury> NY ) and B cells were isolated by negative selection using anti-CD43 
immunomagnetic beads (Miltenyi Biotec Inc.. Auburn. CA ). The resting B cells were 
suspended in Dulbecco's MEM with 10% FBS at a concentration of 1 X 10^/mL, and 100 
25 jiiL was added to the wells of 96-well flat-bottomed plates. 100 ^iL of dilutions of murine 
CD40L-SPD in media or media alone were added to the wells, which were incubated in 
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8.5% C02 at 37 ""C for 48 hours. Then, 0.5 ^Ci/well of ^H-thymidine was added to each 
well, and the cells were collected 4 h later onto glass fiber filters using an automated cell 
harvester. A scintillation counter was used to determine the incorporated radioactivity. 

5 11. Human B cell cultures . 

Venous blood from consenting subjects was used as a source of human B cells under 
a protocol approved by the UCSD Institutional Review Board. Blood was collected into 
syringes containing 5 U/mL heparin and peripheral blood mononuclear cells (PBMC) were 
isolated by centrifugation over FicoU-hypaque. The cells were suspended at 2 X 10^/mL in 
10 RPMI 1640 containing 200 \xM L-glutamine, 10% FBS, 0.832 cyclosporin A (Sigma), 
and 25 ng/mL human IL-4 (R&D Systems) and incubated in 5% CO2 at 37 as described 
by Schultze et al (36). At intervals, the cells were stained with Cy Chrome-conjugated anti- 
CD19 and PE-conjugated anti-CD80 (B7-1) monoclonal antibodies (Pharmingen) and 
analyzed by flow cytometry. 

15 

12. Human monocyte-derived macrophage and dendritic cell cultures. 

As previously described [Kombluth], monocytes were isolated from PBMC 
by adherence to fibronectin-coated plates, plated into 48-well plates, and then 
cultured in RPMI1640 containing 200 ?M L-glutamine and 10% autologous serum 

20 for 7-10 days. Monolayers of the matured cells (about 2 X 10^/well), termed 

monocyte-derived macrophages or MDM, were then washed in media and cultured in 
1 mL/well RPMI1640 containing 200 ?M L-glutamine and 10% heat-inactivated 
FBS. Alternatively, dendritic cells (DC) were formed from monocytes by adding 
GM-CSF and IL-4 to the culture media, and the resulting DC were used 6 days later. 

25 Preparations of CD40L-SPD were added to the wells as indicated. As a positive 
control, 100 ng/mL bacterial lipopolysaccharide (LPS) from E. coU 01 1 1 :B4 
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(Calbiochem) was added. Supematants were collected 24 h later and analyzed for 
cytokine content using ELISA (R&D Systems). 

Example 1. 

5 Design principles in constructing coUectin-TNFSF member fusion proteins. 

To express CD40L and other TNFSF members as stable, multimeric proteins, 
the coding region of the extracellular, C-terminal portion of CD40L was joined in- 
frame to the coUectin, surfactant protein D (SPD). The N-terminus of SPD contains 
two cysteines which form the disulfide bonds necessary for the 4-armed cruciate 

10 structure of the overall molecule [Brown-Augsburger, 1996 #506]. C-terminal to 
these cysteines in SPD is a long triple-heUcal collagenous "stalk" which ends in the 
"neck" region that promotes the trimerization of each arm of the structure. 
Immediately after this neck region, the coding sequence for the extracellular portion 
of CD40L was added, in place of the carbohydrate recognition domain (CRD) of 

15 SPD. The coUectins were chosen as the framework for the multimeric construct 
because of their multi-subunit structure and the trimeric nature of their stalk regions. 
Appropriateness of replacing the CRD of a coUectin with the extracellular region of a 
TNFSF member is fiirther supported by structural studies of the two protein families. 
An analysis of the CRD crystal structure of another coUectin, ACRP30, indicated 

20 that it was structurally superimposable upon the crystal structures of the extracellular 
regions of CD40L, TNF, and Fas [Shapiro, 1998]. The successful expression of the 
coUectin-TNFSF fusion protein, CD40L-SPD, indicates that other TNFSF members 
(Table I) could be conjoined to SPD in a similar manner and that other coUectins 
besides SPD (Table II) could be used as a protein framework instead of SPD. 

25 Because these molecules are formed entirely from naturally occurring proteins, the 
production of an immune response (e.g., antibodies) to these ftision proteins is 
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minimized. By deleting portions of the stalk region of the TNFSF proteins, 
additional constructs can be made which may be even less immunogenic. 



Example 2. 

5 Expression of human and murine CD40L-SPD in CHO cells. 

The coding regions for the extracellular portion of human CD40L, human 
T147N-CD40L ,an inactive mutant of CD40L, or murine CD40L were joined to the 
neck region of murine SPD, replacing the SPD CRD (Fig. 1), A CMV-driven 
expression plasmid for the construct was co-transfected with a DHFR minigene into 
10 DNFR-deficient CHO cells. Following selection in nucleoside-free media, 
expressing CHO clones were amplified by culture in ascending doses of 
methotrexate. The resulting clones produced about 1-10 p,g/mL of the fusion protein 
over a 3 day period in media containing FBS. 

15 Clones were adapted for growth as suspension cells in two types of serum- 

fi-ee media. Murine CHO-SPD produced in UltraCHO (BioWhittaker) was largely 
retained (about 60% as determined by ELISA) by a 1,000 kDa cutoff ultrafiltration 
membrane (Pall Corp., Port Washington, NY), consistent with a large multimeric 
complex formed by the stacking of the SPD portion of the molecule. However, in 

20 CHO-S-SFM II (Life Technologies), nearly all ELISA-detectable murine CHO-SPD 
passed through a 100 kDa cutoff ultrafiltration membrane (Millipore), suggesting 
that the protein was either folding incorrectly in this media or was being degraded by 
proteolysis. Consequently, the purification method was optimized for the spent 
UltraCHO media. 

25 
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Example 3, 

Purification of human and murine CD40L-SPD. 

Purification procedures were developed for murine CD40L-SPD, but the 
same methods could be appUed to human CD40L-SPD with minor modifications. 
5 Murine CD40L-SPD has a predicted m.w. of 49 kDa per chain, or about 600 kDa per 
12-chain, cruciate molecule, the amino acid sequence predicts a pi of 9.10. 
Accordingly, conditioned media was concentrated by ultrafiltration through a 100 
kDa cutoff filter, which also fi*actionates the sample on a size basis. After 
diafiltration into 50 mM bicine, pH 9.00 (also containing 1 mM EDTA added to 

10 inhibit metalloproteinases), the sample was appHed to a variety of cationic exchange 
resins. Using Source 308 (Amersham-Pharmacia), most of the ELISA-detectable 
protein did not bind and was recovered in the flow-through. However, as reported by 
Morris et al. {Morris}, Fractogel SO3 65 OM retained the protein. The retention by 
this tentacular resin and not by Source 308 suggests binding to positively charged 

15 residues that are not on the protein surface. Using a linear NaCl gradient, ELISA- 
detectable protein elutes at between 0.15-0.30 M NaCl under these conditions (Fig. 
2). hi selected experiments, the protein was fiirther purified using a Superose 6 
sizing column. Most of the ELISA-detectable protein eluted in the excluded volume, 
indicating an apparent m.w. of greater than 1,000 kDa (Fig. 3). 

20 

Example 4. 

Activity of CD40L-SPD on human B cells, 

Schultze et al described a system using CD40L-expressing cells plus IL-4 and 
cyclosporin A (to inhibit T cell growth) as a means to grow very large numbers of B cells 
25 from a small sample of blood. Because CD40L activates these B cells to express high levels 
of B7 molecules (CD80 and CD86), the proliferating B cells were effective in presenting 
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peptide antigens and rival non-dividing dendritic cells as antigen-presenting cells (APCs) 
(36). To determine if the CD40L-SPD fusion protein could replace CD40L-expressing cells 
in this system, PBMC were cultured with CD40L-SPD in addition to IL-4 and cyclosporin 
A. Under these conditions the cells grew to saturation density every three days. After three 
5 weeks, the cultures were almost entirely CD19+ B cells which express high levels of CD80 
(Fig. 4), This indicates that CD40L-SPD can be used in ex vivo systems where a soluble yet 
effective form of CD40L is needed to stimulate cells for immunotherapeutic appUcations. 

Example 5. 

10 Activity of CD40L-SPD on murine B cells. 

Resting murine B cells are particularly difficult to stimulate with most 
soluble forms of CD40L. Even with murine CD40L-CD8 fusion proteins, it is 
necessary to crosslink the protein with antibodies against CDS in order to achieve 
maximal proUferation in culture [Klauss, 1999]. Accordingly, resting murine B cells 
15 were negatively selected with immunomagnetic beads. As shown in Fig. 5, murine 
CD40L-SPD was as effective as anti-IgM antibody in driving B cells to proliferate. 
This indicates that CD40L-SPD can mimic the multivalent interactions that occur 
when a responding cell comes in contact with CD40L-bearing activating cells. 

20 Example 6. 

Activity of CD40L-SPD on human macrophages and dendritic cells. 

CD40L is a powerful stimulant for macrophages (reviewed in (28)) and dendritic 
cells (40). Accordingly, preparations of CD40L-SPD were added to monocyte-derived 
macrophages and the production of MIP-1^ was used as a measure of stimulation. As 
25 shown in Fig. 6, both human and murine CD40L-SPD were able to stimulate macrophages, 
whereas the T147N-CD40L-SPD mutant was inactive as expected. 
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DISCUSSION 

These examples define a new method of producing multimeric (i.e., many 
trimers) of CD40L as a fusion protein with SPD. Also prepared and expressed were 
5 similar fusion proteins between murine RANKL/TRANCE (TNFSFl 1) or murine 
CD27L/CD70 (TNFSF7) joined to murine SPD (data not shown). This suggests that 
virtually all TNFSF members could be successfully produced as fusion proteins with 
SPD. Furthermore, it is also likely that other coUectins besides SPD could be used in 
these fusions, given the strong structural homologies between the CRDs of the 

10 coUectins and the extracellular domains of TNFSF members [Shapiro] which can be 
substituted for these CRDs. Given the 17 known TNFSF members and 9 known 
coUectins, at least 153 fusion protein combinations are possible. 

SPD was selected for initially because it is a soluble homopolymer. Other 
coUectins, such as surfactant protein A, have strong binding affinities to lipids and 

15 specific cell receptors. Although removal of the CRD abrogates much of this 

binding, it may be partially mediated by the neck region sequence, which the fusion 
proteins retain. Accordingly, it would be expected that coUectins other than SPD 
might confer different cell-binding and pharmacokinetic behaviors upon a fusion 
protein. For example, macrophages are known to take up and degrade whole SPD 

20 [Dong, 1998]. If a fusion protein other than SPD were used, the disposition of the 
fusion protein in vivo might be altered. Additionally, metalloproteinases are known 
to degrade the coUectin, Clq , so that a fusion with Clq may alter the degradation of 
the fusion protein. For example, because CD40L activates macrophages and other 
cells to produce metaUoproteinases, which could potentially degrade the coUagenous 

25 portion of SPD and other coUectins. Cleavage of the collagenous stalk would then 
be expected to release single-trimers of CD40L, which could diffuse away from the 
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original parent molecule, much like a slow-release formulation of a drug. Also, the 
membrane-proximal portion of CD40L has been retained in CD40L-SPD. This 
sequence also contains protease-susceptible amino acid sequences, which can be 
eliminated by mutagenesis to retard the cleavage of CD40L from the fusion protein. 
Mutations in such proteinase cleavage site(s) would delay such cleavage and favor 
the local persistence of the CD40L stimulus. 

CD40L-SPD is a large macromolecule (> 1,000 kDa), and the other TNFSF- 
collectin fusion proteins would be expected to be similarly large. For native SPD, 
the aggregates that spontaneously form measure 100 nm in diameter. When injected 
into tissue, this large a complex would be expected to remain at the injection site for 
a prolonged period. LocaUzation of the TNFSF-containing protein would also be 
expected to reduce any systemic toxicity caused by the release of free single-trimers 
into the circulation. For example, soluble CD40L in blood has been linked to disease 
activity in lupus, and this smaller molecule may even cross the glomerulus to cause 
damage to renal tubules [Kato and Kipps, J. Clin. Invest. Nov. 1999]. On the other 
hand, because CD40L induces the production of chemokines which attract immune 
cells [Kombluth], T cells, monocytes, and dendritic cells would be expected migrate 
to the site where CD40L-SPD was injected. This might be advantageous if CD40L- 
SPD were used as a vaccine adjuvant. In mice, soluble CD40L (sCD40LT) 
stimulates IgGl production but not cytotoxic T lymphocytes (CTLs) [Wong, 1999]. 
Interestingly, the same protein that is expressed from an injected plasmid stimulates 
both a strong antibody and CTL response [Gurunathan, 1998]. In the latter case, the 
plasmid would be expected to deliver a localized supply of CD40L, whereas the 
SCD40LT protein is free to diffuse away. Support for the locahzed use of CD40L in 
an adjuvant formulation is provided by a study using a plasmid expressing full-length 



33 



membrane CD40L, which was very effective in stimulating both humoral and CTL 
immune responses [Mendoza, 1997]. Similarly, injection of adenovirus expressing 
membrane CD40L has potent antitumor activity in mice [Kikuchi, 1999]. Similar 
considerations would likely apply to other fusion proteins between the TNFSF and 
collectins. 

Finally, for immunostimulatory proteins, it is particularly important that the 
protein not be antigenic if repeated injections are needed. For example, vaccination 
with TNF-|a modified by the addition of short peptide sequences was able to induce 
the production of disease-modifying anti-TNF-]i autoantibodies [Dalum, 1999]. 
Because CD40L-SPD and other TNFSF-collectin fusion proteins are formed from 
endogenous protein sequences (with the possible exception of the peptide sequence 
at the junction), the production of antibodies might not limit the effectiveness of 
repeated injections. 

In conclusion, fusions between TNFSF members and collectins offer a novel 
means of generating large protein complexes which can provide localized stimulation 
at an injection site. Because of the multimeric nature of the coUectin backbone, such 
fusion proteins may mimic the multivalent ligand surface presented by the membrane 
forms of TNFSF members to TNFRSF-bearing responding cells. Moreover, by 
Umiting systemic toxicity while maintaining locahzed efficacy, such fusion proteins 
may have a role as vaccine adjuvants against infectious agents and tumors. 
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New Ligand Symbol 

LTA 

TNF 

LTB 

TNFSF4 

TNFSF5 

TNFSF6 

TNFSF7 

TNFSF8 

TNFSF9 

TNFSFIO 

TNFSFll 

TNFSF12 

TNFSF13 

TNFSF13B 

TNFSF14 

TNFSF15 

TNFSF16 

TNFSF17 



Table I 

Ligands of the TNF Superfamily* 

Other Names 

Lymphotoxin-, TNF-?, TNFSFl 
TNF-?, TOFSF2 
Lymphotoxin-, TNFSF3 
OX-40L 

CD40L, CD 154. Gp39, T-BAM 
FasL 

CD27L, CD70 

CD30L 

4-lBBL 

TRAIL, Apo-2L 

RANKL, TRANCE, OPGL, ODF 

TWEAK, Apo-3L 

APRIL 

BAFF, THANK, BLYS 
LIGHT, HVEM-L 
VEGI 

unidentified 
unidentified 
AITRL, GITRL 



Genbank ID 

X01393 

X02910 

L11016 

D90224 

X67878 

U11821 

L08096 

L09753 

U03398 

U37518 

AF013171 

AF030099 

NM_003808 

AF136293 

AF036581 

AF039390 



AF125303 



TNFSFl 8 

*(asofNov. 1, 1999) 

Known members of ligands in the TNF superfamily, taken firom the Human Gene 
Nomenclature Committee at http://www.gene.ucl.ac.uk/users/hester/tnftop.htm 
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Table II 



The CoUectin Superfamily 



Clq 

Mannose-binding protein, 



Pulmonary surfactant 

protein D 

conglutinin 

collectin-43 

CL-Ll 

ACRP30 

Hib27 



MBLl 



Mannose-binding protein. 



MBL2 



Pulmonary surfactant 
protein A 



All coUectins are formed as multimers of trimeric subunits, each containing a 
collagenous domain. The C-terminus of each coUectin contains a CRD which binds 
carbohydrates and other Ugands. Because of the tight similarities between the known 
CRD structures and the extracellular domains of TNFSF members, it is likely that the 
CRD of any coUectin could be replaced with the extracellular domain of any TNFSF 
member in a structurally compatible manner. 

While the present invention has now been described in terms of certain 
preferred embodiments, and exemphfied with respect thereto, one skilled in the art 
will readily appreciate that various modifications, changes, omissions and 
substitutions may be made without departing from the spirit thereof. It is intended, 
therefore, that the present invention be limited solely by the scope of the following 
claims. 
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CLAIMS 



What is claimed is: 

1 . A hybrid nucleotide sequence of no more than 1 528 base pairs including a sequence 
defining a structural gene expressing a conjoined single strand of a multimeric TNFSF-SPD 
fusion protein, said structural gene having a nucleotide base sequence selected from 
members of the group consisting of SEQ ID NO 1, SEQ ID NO 3 and SEQ ID NO 5. 

2. The DNA segment according to claim 1, wherein the structural gene comprises: 

a segment expressing a single hybrid amino acid chain of TNFSF-SPD, said segment 
having 

a first SPD nucleotide base sequence of SEQ ID NO 1, from base 32 to base 
799, and 

a second sequence, expressing a portion of TNFSF stalk, selected from 
members of the group consisting of SEQ ID NO 1, from base 800 to base 1444, SEQ 
ID NO 3, from base 800 to base 1528, and SEQ ID NO 5, from base 800 to base 
1441. 

3. A recombinant DNA molecule comprising a vector operatively linked to an 
exogenous DNA segment defining a structural gene expressing a single amino acid chain of 
TNFSF-SPD, said structural gene having 

a nucleotide base sequence selected from members of the group consisting of SEQ ID 
NO 1, SEQ ID NO 3 and SEQ ID NO 5, any functional equivalents and modifications 
thereof, and 



37 



an appropriate promoter for driving the expression of said structural gene in a 
compatible host organism. 



4. The recombinant DNA molecule as described in claim 3 wherein said host organism 
5 is E. coli. 



5. The recombinant DNA molecule as described in claim 3, wherein said host 
organism is a yeast. 
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6, The recombinant DNA molecule as described in claim 3, wherein said host 



organism is a higher plant or animal. 



7. A multimeric TNFSF-SPD fusion protein comprising: 
a plurality of polypeptide trimers, 



i) a first trimer consisting of peptide strands of members of the 
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TNF superfamily (TNFSF) of ligands, and 
ii) a second trimer strand from a coUectin molecule 



each first trimer conjoined to a second polypeptide trimer strand from 



20 



a coUectin molecule, wherein said ligand strand is substituted for native 
carbohydrate recognition domains (CRD) of the coUectin molecules, 



25 



said conjoined coUectin strands covalently bound in parallel to each 
other, therein forming a multimeric flision protein comprising a plurality of 
trimeric hybrid polypeptide strands radiating fi-om a covalently bound center 
hub of the molecule, the free end of each trimeric radiating strand having a 
TNFSF moiety attached. 
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8. The multimeric fusion protein according to claim 6, wherein the TNFSF moiety is 
one selected from the group consisting of Ugands LTA, TNF, LTB, and TNFSF4 to TNFSF 
1 8 as in Table II, their funtional equivalents, and modifications thereof. 



5 9. A method for preparing a CD40-SPD multimeric fusion polypeptide, comprising the 

steps of: 

initiating a culture, in a nutrient medium, of procaryotic or eucaryotic host cells 
transformed with a recombinant DNA molecule including an expression vector, appropriate 
for said cells, operatively linked to an exogenous DNA segment defining a structural gene 
10 for CD40-SPD ligand, said structural gene having a nucleotide base sequence of SEQ ID NO 
1 from about base 32 to about base 1444; and 

maintaining said culture for a time period sufficient for said cells to express said 
multimeric molecule. 



15 10. A method of producing a secreted, large, biologically active, multimeric tumor 

necrosis factor superfamily ligand fusion protein chimera that is highly immunogenic and 
not readily diffusable, comprising: 

introducing into a host cell a first chimeric DNA construct including 

a transcriptional promoter operatively linked to a first secretory signal 
20 sequence, followed downstream by, and in proper reading frame with 

a first DNA sequence encoding a polypeptide chain of a first 
TNFSF ligand requiring multimerization for biological activity, 
joined to 

a second DNA sequence encoding a coUectin polypeptide at 
25 the site where the coUectin' s CRD was purposefully removed, 

introducing into said host cell a second DNA construct including 
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a transcriptional promoter operably linked to a second 
secretory signal sequence followed downstream by, and in proper 
reading frame with, 

a third DNA sequence encoding a second polypeptide chain of 
a second TNFSF Ugand , joined to 

a fourth DNA sequence encoding a coUectin polypeptide , 
wherein the coUectin' s CRD was purposefully removed; 
growing said host cell in an appropriate growth medium under physiological 
conditions to allow the secretion of a large multimerized polypeptide fusion protein, wherein 
said first polypeptide chain of a TNFSF-SPD protein is bound by parallel bonding of 
the respective coUectin domain trimer to 

said second polypeptide chain of a different TNFSF-SPD polypeptide trimer, and 
wherein said multimerized polypeptide fusion protein exhibits biological activity 
characteristic of both membrane-attached TNFSFs; and 

isolating said biologically active, multimerized TNFSF-SPD polypeptide fusion from 
said host cell. 

11. The method according to claim 10, wherein the chimeric reactant compounds are 
humanized to guard against destruction by a potential human recipient's immune system. 

12. A method of preparing a multimeric TNFSF-SPD Ugand fusion protein, comprising: 

preparing a first DNA segment coding for a strand of an exposed extracellular 
portion of TNFSF; 

preparing a second DNA segment coding for a coUectin polypeptide strand, 
wherein said coUectin' s CRD domain of the strand has been removed; 

conjoining said first and second DNAs in proper reading frame, therein creating 
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a TNFSF-coUectin DNA construct; 

inserting said construct into an expression vector system; 

introducing said vector system into an appropriate cell in culture under suitable 
conditions; 

harvesting and purifying spent medium from said culture; and 
assaying for presence of multimeric TNFSF-coUectin fusion protein. 

13. A method for stimulating the immune response in potentially immonocompetent 
cells using multimeric TNFSF fusion proteins, comprising: 

contacting said cells with said multimeric TNFSF fusion proteins, wherein said cells 
are induced to proliferate. 

14. The method according to claim 15, wherein the cells are resting B cells. 

15. A method for increasing antigenicity of cells, comprising: 

contacting said cells with said multimeric TNFSF fusion proteins, wherein said cells are 
tiunor cells or HIV positive cells. 
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ABSTRACT 



A method for constructing stable bioactive fusion proteins of the difficult to express 
tumor necrosis factor superfamily (TNFSF), and particularly members CD40L (CD 154) and 
RANKL/TRANCE, with coUectins, particularly pulmonary surfactant protein D (SPD) is 
described. Single trimers of these proteins lack the full stimulatory efficacy of the natural 
membrane forms of these proteins in many cases. The multimeric nature of these soluble 
fusion proteins enables them to engage multiple receptors on the responding cells, thereby, 
mimicking the effects of the membrane forms of these hgands. For CD40L-SPD, the 
resulting protein stimulates B cells, macrophages, and dendritic cells, indicating its potential 
usefulness as a vaccine adjuvant. The large size of these fusion proteins makes them less 
likely to diffuse into the circulation, thereby limiting their potential systemic toxicity. This 
property may be especially useful when these proteins are injected locally as a vaccine 
adjuvant or tumor immunotherapy agent to prevent them from diffusing away. In addition, 
these and other TNFSF-collectin fusion proteins present new possibilities for the expression 
of highly active, multimeric, soluble TNFSF members. 
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Purification of murine 
CD40L-SPD by cationic 
exciiange cliromatograpiiy 
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SEQUENCE LISTING 



<110> Kornbluth, Richard S 

<120> Multimeric forms of CD40L and other TNF family members 
<130> TNFSF-collectin fusion proteins 

<140> 60/111,471 
<141> 1998-12-08 

<160> 6 

<170> Patentin Ver, 2.1 

<210> 1 ' 
<211> 1552 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> 5'UTR 
<222> (7) . . (31) 

<220> 

<221> misc_feature 
<222> (88) . . (799) 

<2 23> Mature murine surfactant protein D including hub 
region, collagenous portion, and neck, but 
excluding carbohydrate recognition domain (CRD) 

<220> 

<221> misc_feature 
<222> (801) . . (1546) 

<223> Human CD40 ligand extracellular region, including 
stalk. 

<220> 

<221> sig_peptide 
<222> (32) . . (88) 

<223> Signal peptide from murine surfactant protein D 

<220> 

<221> CDS 

<222> (32) . . (1444) 

<220> 

<223> Description of Artificial Sequence: Murine 
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surfactant protein D (without the CRD) fused to 
the extracellular portion of human CD40L 

<300> 

<301> Spriggs, Melanie K. 

Armitage, Richard J. 

Strockbine, L 

Clifford, K N. 

Macduff, B M. 

Sato, T A. 

Maliszewski, C R. 

Fanslow, William C. 
<3 02> Recombinant human CD40 ligand stimulates B cell 

proliferation and immunoglobulin E secretion. 
<303> J. Exp. Med. 
<304> 176 
<305> 6 

<306> 1543-1550 

<307> 1992 

<313> 801 TO 1600 

<300> 

<3 01> Motwani, M 
White, R A. 

<302> Mouse surfactant protein-D. cDNA cloning, 

characterization, and gene localization to chromosome 
14, 

<3 03> J. Immunol. 
<304> 155 
<305> 12 
<306> 5671-5677 
<307> 1995 
<313> 32 TO 800 

<400> 1 

gctagcgaat tccaccagga agcaatctga c atg ctg ccc ttt etc tec atg 52 

Met Leu Pro Phe Leu Ser Met 
1 5 

ctt gtc ttg ctt gta cag ccc ctg gga aat ctg gga gca gaa atg aag 100 
Leu Val Leu Leu Val Gin Pro Leu Gly Asn Leu Gly Ala Glu Met Lys 
10 15 20 

age etc teg cag aga tea gta cec aac acc tge acc eta gtc atg tgt 148 
Ser Leu Ser Gin Arg Ser Val Pro Asn Thr Cys Thr Leu Val Met Cys 
25 30 35 

age cea aea gag aat gge etg cct ggt egt gat gga egg gat ggg aga 19 6 
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Ser Pro Thr Glu Asn Gly Leu Pro Gly Arg Asp Gly Arg Asp Gly Arg 
40 45 50 55 



gaa ggt cca egg ggt gag aag ggt gat cca ggt ttg cca gga cct atg 244 
Glu Gly Pro Arg Gly Glu Lys Gly Asp Pro Gly Leu Pro Gly Pro Met 

60 65 70 

ggg etc tea ggg ttg cag ggc cct aca ggt cca gtt gga ccc aaa gga 2 92 
Gly Leu Ser Gly Leu Gin Gly Pro Thr Gly Pro Val Gly Pro Lys Gly 
75 80 85 

gag aat ggc tct get ggc gaa cct gga cca aag gga gaa cgt gga eta 3 40 
Glu Asn Gly Ser Ala Gly Glu Pro Gly Pro Lys Gly Glu Arg Gly Leu 
90 95 100 

agt gga cct cca gga ctt cca ggt att cct ggt cca get ggg aaa gaa 3 88 
Ser Gly Pro Pro Gly Leu Pro Gly lie Pro Gly Pro Ala Gly Lys Glu 
105 110 115 

ggt ccc tct ggg aag cag ggg aac ata gga cct caa ggc aaa cca ggt 43 6 
Gly Pro Ser Gly Lys Gin Gly Asn He Gly Pro Gin Gly Lys Pro Gly 
120 125 130 135 

cct aaa gga gag get ggg ccc aaa gga gaa gta ggt get cct ggc atg 484 
Pro Lys Gly Glu Ala Gly Pro Lys Gly Glu Val Gly Ala Pro Gly Met 

140 145 150 

caa gga tct aca ggg gca aaa ggc tec aca ggc ccc aag gga gaa aga 532 
Gin Gly Ser Thr Gly Ala Lys Gly Ser Thr Gly Pro Lys Gly Glu Arg 
155 160 165 

ggt gcc cct ggt gtg caa gga gcc cca ggg aat get gga gea gca gga 580 
Gly Ala Pro Gly Val Gin Gly Ala Pro Gly Asn Ala Gly Ala Ala Gly 
170 175 180 

cct gcc gga cct gcc ggt cca cag 'gga get cca ggt tee agg ggg ccc 628 
Pro Ala Gly Pro Ala Gly Pro Gin Gly Ala Pro Gly Ser Arg Gly Pro 
185 190 195 

cca gga etc aag ggg gac aga ggt gtt cct gga gac aga gga ate aaa 67 6 
Pro Gly Leu Lys Gly Asp Arg Gly Val Pro Gly Asp Arg Gly He Lys 
200 205 210 215 

ggt gaa age ggg ctt cca gac agt get get ctg agg cag cag atg gag 724 
Gly Glu Ser Gly Leu Pro Asp Ser Ala Ala Leu Arg Gin Gin Met Glu 

220 225 230 

gee tta aaa gga aaa eta cag cgt eta gag gtt gcc ttc tec cae tat 77 2 
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Ala Leu Lys Gly 
235 

cag aaa get gca 
Gin Lys Ala Ala 
250 

gaa gat gaa agg 
Glu Asp Glu Arg 
265 

cag aga tgc aac 
Gin Arg Cys Asn 
280 

gag att aaa age 
Glu lie Lys Ser 



aaa gag gag acg 
Lys Glu Glu Thr 
315 

cag aat cat caa 
Gin Asn Pro Gin 
330 

aca aca tct gtg 
Thr Thr Ser Val 
345 

aac aac ttg gta 
Asn Asn Leu Val 
360 

caa gga etc tat 
Gin Gly Leu Tyr 

gaa get teg agt 
Glu Ala Ser Ser 
395 

ecc ggt aga ttc 
Pro Gly Arg Phe 
410 

tec gcc aaa cct 



Lys Leu Gin 

ttg ttc cct 
Leu Phe Pro 

aat ctt cat 
Asn Leu His 
270 

aca gga gaa 
Thr Gly Glu 
285 

cag ttt gaa 
Gin Phe Glu 
300 

aag aaa gaa 
Lys Lys Glu 

att gcg gca 
He Ala Ala 

tta cag tgg 
Leu Gin Trp 
350 

aec ctg gaa 
Thr Leu Glu 
365 

tat ate tat 
Tyr He Tyr 
380 

caa get cca 
Gin Ala Pro 



gag aga ate 
Glu Arg He 



tge ggg caa 



Arg Leu Glu Val 
240 

gat ggc cat aga 
Asp Gly His Arg 
255 

gaa gat ttt gta 
Glu Asp Phe Val 



aga tee tta tee 
Arg Ser Leu Ser 
290 

ggc ttt gtg aag 
Gly Phe Val Lys 
305 

aac age ttt gaa 
Asn Ser Phe Glu 
320 

cat gtc ata agt 
His Val He Ser 
335 

get gaa aaa gga 
Ala Glu Lys Gly 



aat ggg aaa cag 
Asn Gly Lys Gin 
370 

gee eaa gtc ace 
Ala Gin Val Thr 
385 

ttt ata gcc age 
Phe He Ala Ser 
400 

tta etc aga get 
Leu Leu Arg Ala 
415 

eaa tec att cac 



Ala Phe Ser 
245 

agg ttg gae 
Arg Leu Asp 
260 

ttc atg aaa 
Phe Met Lys 
275 

tta ctg aac 
Leu Leu Asn 



gat ata atg 
Asp He Met 

atg eaa aaa 
Met Gin Lys 
325 

gag gcc age 
Glu Ala Ser 
340 

tac tac ace 
Tyr Tyr Thr 
355 

ctg ace gtt 
Leu Thr Val 



ttc tgt tec 
Phe Cys Ser 

etc tgc eta 
Leu Cys Leu 
405 

gca aat ace 
Ala Asn Thr 
420 

ttg gga gga 



His Tyr 

aag ata 820 
Lys He 



acg ata 868 
Thr He 



tgt gag 916 
Cys Glu 
295 

tta aac 964 

Leu Asn 

310 

ggt gat 1012 
Gly Asp 

agt aaa 1060 
Ser Lys 

atg age 1108 
Met Ser 



aaa aga 1156 
Lys Arg 
375 

aat egg 1204 

Asn Arg 

390 

aag tec 1252 
Lys Ser 

cae agt 1300 
His Ser 

gta ttt 1348 
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Ser Ala Lys Pro Cys Gly Gin Gin Ser He His Leu Gly Gly Val Phe 
425 430 435 

gaa ttg caa cca ggt get teg gtg ttt gtc aat gtg act gat eca age 1396 
Glu Leu Gin Pro Gly Ala Ser Val Phe Val Asn Val Thr Asp Pro Ser 
440 445 450 455 

eaa gtg age cat ggc act ggc ttc acg tec ttt gge tta etc aaa etc 1444 
Gin Val Ser His Gly Thr Gly Phe Thr Ser Phe Gly Leu Leu Lys Leu 

460 465 470 

tgaacagtgt cacettgcag gctgtggtgg agctgacgct gggagtcttc ataatacagc 1504 

acaggettaa geeeaattat acacteeaag geatgtagaa ctggtaec 1552 



<210> 2 
<211> 471 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Murine 

surfactant protein D (without the CRD) fused to 
the extracellular portion of human CD40L 

<400> 2 

Met Leu Pro Phe Leu Ser Met Leu Val Leu Leu Val Gin Pro Leu Gly 
15 10 15 

Asn Leu Gly Ala Glu Met Lys Ser Leu Ser Gin Arg Ser Val Pro Asn 
20 25 30 

Thr Cys Thr Leu Val Met Cys Ser Pro Thr Glu Asn Gly Leu Pro Gly 
35 40 45 

Arg Asp Gly Arg Asp Gly Arg Glu Gly Pro Arg Gly Glu Lys Gly Asp 
50 55 60 

Pro Gly Leu Pro Gly Pro Met Gly Leu Ser Gly Leu Gin Gly Pro Thr 
65 70 75 80 

Gly Pro Val Gly Pro Lys Gly Glu Asn Gly Ser Ala Gly Glu Pro Gly 

85 90 95 

Pro Lys Gly Glu Arg Gly Leu Ser Gly Pro Pro Gly Leu Pro Gly He 
100 105 110 

Pro Gly Pro Ala Gly Lys Glu Gly Pro Ser Gly Lys Gin Gly Asn He 
115 120 125 
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Gly Pro Gin Gly Lys Pro Gly Pro Lys Gly Glu Ala Gly Pro Lys Gly 
130 135 140 



Glu Val Gly Ala Pro Gly Met Gin Gly Ser Thr Gly Ala Lys Gly Ser 
145 150 155 160 

Thr Gly Pro Lys Gly Glu Arg Gly Ala Pro Gly Val Gin Gly Ala Pro 

165 170 175 

Gly Asn Ala Gly Ala Ala Gly Pro Ala Gly Pro Ala Gly Pro Gin Gly 
180 185 190 

Ala Pro Gly Ser Arg Gly Pro Pro Gly Leu Lys Gly Asp Arg Gly Val 
195 200 205 

Pro Gly Asp Arg Gly He Lys Gly Glu Ser Gly Leu Pro Asp Ser Ala 
210 215 220 

Ala Leu Arg Gin Gin Met Glu Ala Leu Lys Gly Lys Leu Gin Arg Leu 
225 230 235 240 

Glu Val Ala Phe Ser His Tyr Gin Lys Ala Ala Leu Phe Pro Asp Gly 

245 250 255 

His Arg Arg Leu Asp Lys He Glu Asp Glu Arg Asn Leu His Glu Asp 
260 265 270 

Phe Val Phe Met Lys Thr He Gin Arg Cys Asn Thr Gly Glu Arg Ser 
275 280 285 

Leu Ser Leu Leu Asn Cys Glu Glu He Lys Ser Gin Phe Glu Gly Phe 
290 295 300 

Val Lys Asp He Met Leu Asn Lys Glu Glu Thr Lys Lys Glu Asn Ser 
305 310 315 320 

Phe Glu Met Gin Lys Gly Asp Gin Asn Pro Gin He Ala Ala His Val 

325 330 335 

He Ser Glu Ala Ser Ser Lys Thr Thr Ser Val Leu Gin Trp Ala Glu 
340 345 350 

Lys Gly Tyr Tyr Thr Met Ser Asn Asn Leu Val Thr Leu Glu Asn Gly 
355 360 365 



Lys Gin Leu Thr Val Lys Arg Gin Gly Leu Tyr Tyr He Tyr Ala Gin 
370 375 380 
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Val Thr Phe Cys 
385 

Ala Ser Leu Cys 



Arg Ala Ala Asn 
420 

lie His Leu Gly 
435 

Val Asn Val Thr 
450 

Ser Phe Gly Leu 
465 



Ser Asn Arg Glu 
390 

Leu Lys Ser Pro 
405 

Thr His Ser Ser 



Gly Val Phe Glu 
440 

Asp Pro Ser Gin 
455 

Leu Lys Leu 
470 



Ala Ser Ser Gin 
395 

Gly Arg Phe Glu 
410 

Ala Lys Pro Cys 
425 

Leu Gin Pro Gly 



Val Ser His Gly 
460 



Ala Pro Phe lie 
400 

Arg lie Leu Leu 
415 

Gly Gin Gin Ser 
430 

Ala Ser Val Phe 
445 

Thr Gly Phe Thr 



<210> 3 
<211> 1574 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> 5'UTR 
<222> (7) . . (31) 

<223> 5' UTR taken from rat sequence for surfactant 
protein D 

<220> 

<221> sig_peptide 
<222> (32) . . (88) 

<223> Signal peptide from murine surfactant protein D 

<220> 

<221> CDS 

<222> (32) . . (1534) 

<220> 

<221> misc_feature 
<222> (32) . . (800) 

<223> Murine surfactant protein D including hub region, 
collagenous portion, and neck, but excluding 
carbohydrate recognition domain (CRD) 
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<220> 

<221> misc_f eature 
<222> (801) . . (1534) 

<223> Murine RANKL/TRANCE extracellular region, 
including stalk 



<300> 

<301> Motwani, M 
White, R A. 
Guo, N 
Dowler, L L. 
Tauber, A I. 
Motwani , M 

<302> Mouse surfactant protein-D. cDNA cloning, 

characterization, and gene localization to chromosome 
14. 

<303> J. Immunol. 
<304> 155 
<305> 12 
<306> 5671-5677 
<307> 1995 
<313> 32 TO 800 



<300> 

<301> Anderson, D M. 

Maraskovsky, E 
Billingsley, W L. 
Dougall, W C. 

<302> A homologue of the TNF receptor and its ligand enhance 

T-cell growth and dendritic-cell function. 
<303> Nature 
<304> 390 
<305> 6556 
<306> 175-179 
<307> 1997 
<313> 801 TO 1534 



<400> 3 

gctagcgaat tccaccagga agcaatctga c atg ctg ccc ttt etc tec atg 

Met Leu Pro Phe Leu Ser Met 
1 5 

ctt gtc ttg ctt gta cag ccc ctg gga aat ctg gga gca gaa atg aag 
Leu Val Leu Leu Val Gin Pro Leu Gly Asn Leu Gly Ala Glu Met Lys 
10 15 20 

age etc teg cag aga tea gta ccc aac ace tgc ace eta gte atg tgt 
Ser Leu Ser Gin Arg Ser Val Pro Asn Thr Cys Thr Leu Val Met Cys 



52 



100 



148 
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25 30 35 

age cca aca gag aat ggc ctg cct ggt cgt gat gga egg gat ggg aga 19 6 
Ser Pro Thr Glu Asn Gly Leu Pro Gly Arg Asp Gly Arg Asp Gly Arg 
40 45 50 55 

gaa ggt cca egg ggt gag aag ggt gat cca ggt ttg cca gga cct atg 244 
Glu Gly Pro Arg Gly Glu Lys Gly Asp Pro Gly Leu Pro Gly Pro Met 

60 65 70 

ggg etc tea ggg ttg cag ggc cct aca ggt cea gtt gga cec aaa gga 292 
Gly Leu Ser Gly Leu Gin Gly Pro Thr Gly Pro Val Gly Pro Lys Gly 
75 80 85 

gag aat ggc tet get ggc gaa cct gga cca aag gga gaa cgt gga eta 340 
Glu Asn Gly Ser Ala Gly Glu Pro Gly Pro Lys Gly Glu Arg Gly Leu 
90 95 100 

agt gga cct cca gga ett cca ggt att cct ggt cea get ggg aaa gaa 388 
Ser Gly Pro Pro Gly Leu Pro Gly He Pro Gly Pro Ala Gly Lys Glu 
105 110 115 

ggt ccc tct ggg aag cag ggg aac ata gga cct caa ggc aaa cea ggt 436 
Gly Pro Ser Gly Lys Gin Gly Asn He Gly Pro Gin Gly Lys Pro Gly 
120 125 130 135 

cct aaa gga gag get ggg ecc aaa gga gaa gta ggt get cct ggc atg 484 
Pro Lys Gly Glu Ala Gly Pro Lys Gly Glu Val Gly Ala Pro Gly Met 

140 145 150 

caa gga tet aca ggg gea aaa ggc tec aca ggc ecc aag gga gaa aga 532 
Gin Gly Ser Thr Gly Ala Lys Gly Ser Thr Gly Pro Lys Gly Glu Arg 
155 160 165 

ggt gee cct ggt gtg caa gga gee cca ggg aat get gga gea gea gga 580 
Gly Ala Pro Gly Val Gin Gly Ala Pro Gly Asn Ala Gly Ala Ala Gly 
170 175 180 

cct gee gga cct gee ggt cca cag gga get cca ggt tec agg ggg cec 628 
Pro Ala Gly Pro Ala Gly Pro Gin Gly Ala Pro Gly Ser Arg Gly Pro 
185 190 195 

cca gga etc aag ggg gac aga ggt gtt cct gga gae aga gga ate aaa 67 6 
Pro Gly Leu Lys Gly Asp Arg Gly Val Pro Gly Asp Arg Gly He Lys 
200 205 210 215 

ggt gaa age ggg ett cea gac agt get get ctg agg cag cag atg gag 7 24 
Gly Glu Ser Gly Leu Pro Asp Ser Ala Ala Leu Arg Gin Gin Met Glu 
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220 



225 



230 



gcc tta aaa gga aaa eta cag cgt eta gag gtt gee ttc tee cae tat 772 
Ala Leu Lys Gly Lys Leu Gin Arg Leu Glu Val Ala Phe Ser His Tyr 
235 240 245 

eag aaa get gea ttg tte act gat ggt cga gcg cag atg gat cat aae 820 
Gin Lys Ala Ala Leu Phe Pro Asp Gly Arg Ala Gin Met Asp Pro Asn 
250 255 260 

aga ata tea gaa gae age act cae tge ttt tat aga ate ctg aga etc 868 
Arg He Ser Glu Asp Ser Thr His Cys Phe Tyr Arg He Leu Arg Leu 
265 270 275 

cat gaa aae gca ggt ttg eag gac teg act ctg gag agt gaa gae aea 916 
His Glu Asn Ala Gly Leu Gin Asp Ser Thr Leu Glu Ser Glu Asp Thr 
280 285 290 295 

eta cct gac tec tge agg agg atg aaa eaa gee ttt eag ggg gee gtg 9 64 
Leu Pro Asp Ser Cys Arg Arg Met Lys Gin Ala Phe Gin Gly Ala Val 

300 305 310 

eag aag gaa ctg eaa eac att gtg ggg cca eag cge ttc tea gga get 1012 
Gin Lys Glu Leu Gin His He Val Gly Pro Gin Arg Phe Ser Gly Ala 
315 320 325 

cca get atg atg gaa ggc tea tgg ttg gat gtg gcc eag cga ggc aag 1060 
Pro Ala Met Met Glu Gly Ser Trp Leu Asp Val Ala Gin Arg Gly Lys 
330 335 340 

eet gag gcc eag cca ttt gea cae etc aee ate aat get gee 'age ate 1108 
Pro Glu Ala Gin Pro Phe Ala His Leu Thr He Asn Ala Ala Ser He 
345 350 355 

cca teg ggt tee eat aaa gte act ctg tec tet tgg tac cae gat ega 1156 
Pro Ser Gly Ser His Lys Val Thr Leu Ser Ser Trp Tyr His Asp Arg 
360 365 370 375 

ggc tgg gcc aag ate tct aac atg aeg tta age aae gga aaa eta agg 1204 
Gly Trp Ala Lys He Ser Asn Met Thr Leu Ser Asn Gly Lys Leu Arg 

380 385 390 

gtt aac eaa gat ggc ttc tat tac ctg tac gcc aac att tge ttt egg 1252 
Val Asn Gin Asp Gly Phe Tyr Tyr Leu Tyr Ala Asn He Cys Phe Arg 
395 400 405 

cat cat gaa aca teg gga age gta cct aea gac tat ett eag ctg atg 1300 
His His Glu Thr Ser Gly Ser Val Pro Thr Asp Tyr Leu Gin Leu Met 
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410 415 420 

gtg tat gtc gtt aaa acc age ate aaa ate cea agt tet cat aae etg 13 48 
Val Tyr Val Val Lys Thr Ser He Lys He Pro Ser Ser His Asn Leu 
425 430 435 

atg aaa gga ggg age acg aaa aae tgg teg gge aat tct gaa tte cac 13 96 
Met Lys Gly Gly Ser Thr Lys Asn Trp Ser Gly Asn Ser Glu Phe His 
440 445 450 455 

ttt tat tec ata aat gtt ggg gga ttt tte aag etc ega get ggt gaa 1444 
Phe Tyr Ser He Asn Val Gly Gly Phe Phe Lys Leu Arg Ala Gly Glu 

460 465 470 

gaa att age att cag gtg tec aae cct tee etg etg gat ecg gat caa 1492 
Glu He Ser He Gin Val Ser Asn Pro Ser Leu Leu Asp Pro Asp Gin 
475 480 485 

gat gcg acg tac ttt ggg get tte aaa gtt cag gac ata gac 153 4 

Asp Ala Thr Tyr Phe Gly Ala Phe Lys Val Gin Asp He Asp 
490 495 500 



tgagacteat ttegtggaae attageggee getaaaetat 



1574 



<210> 4 
<211> 501 
<212> PRT 

<213> Artificial Sequence 
<400> 4 

Met Leu Pro Phe Leu Ser Met Leu Val Leu Leu Val Gin Pro Leu Gly 
15 10 15 

Asn Leu Gly Ala Glu Met Lys Ser Leu Ser Gin Arg Ser Val Pro Asn 
20 25 30 

Thr Cys Thr Leu Val Met Cys Ser Pro Thr Glu Asn Gly Leu Pro Gly 
35 40 45 

Arg Asp Gly Arg Asp Gly Arg Glu Gly Pro Arg Gly Glu Lys Gly Asp 
50 55 60 

Pro Gly Leu Pro Gly Pro Met Gly Leu Ser Gly Leu Gin Gly Pro Thr 
65 70 75 80 

Gly Pro Val Gly Pro Lys Gly Glu Asn Gly Ser Ala Gly Glu Pro Gly 

85 90 95 



11 



Pro Lys Gly Glu 
100 

Pro Gly Pro Ala 
115 

Gly Pro Gin Gly 
130 

Glu Val Gly Ala 
145 

Thr Gly Pro Lys 



Gly Asn Ala Gly 
180 

Ala Pro Gly Ser 
195 

Pro Gly Asp Arg 
210 

Ala Leu Arg Gin 
225 

Glu Val Ala Phe 



Arg Ala N31n Met 
260 

Phe Tyr Arg lie 
275 

Thr Leu Glu Ser 

290 

Gin Ala Phe Gin 
305 

Pro Gin Arg Phe 



Asp Val Ala Gin 
340 



Arg Gly Leu Ser 



Gly Lys Glu Gly 
120 

Lys Pro Gly Pro 
135 

Pro Gly Met Gin 
150 

Gly Glu Arg Gly 
155 

Ala Ala Gly Pro 



Arg Gly Pro Pro 
200 

Gly He Lys Gly 
215 

Gin Met Glu Ala 
230 

Ser His Tyr Gin 
245 

Asp Pro Asn Arg 



Leu Arg Leu His 
280 

Glu Asp Thr Leu 
295 

Gly Ala Val Gin 
310 

Ser Gly Ala Pro 
325 

Arg Gly Lys Pro 



Gly Pro Pro Gly 
105 

Pro Ser Gly Lys 



Lys Gly Glu Ala 
140 

Gly Ser Thr Gly 
155 

Ala Pro Gly Val 
170 

Ala Gly Pro Ala 
185 

Gly Leu Lys Gly 



Glu Ser Gly Leu 
220 

Leu Lys Gly Lys 
235 

Lys Ala Ala Leu 
250 

He Ser Glu Asp 
265 

Glu Asn Ala Gly 



Pro Asp Ser Cys 
300 

Lys Glu Leu Gin 
315 

Ala Met Met Glu 
330 

Glu Ala Gin Pro 
345 



Leu Pro Gly He 
110 

Gin Gly Asn He 
125 

Gly Pro Lys Gly 



Ala Lys Gly Ser 
160 

Gin Gly Ala Pro 
175 

Gly Pro Gin Gly 
190 

Asp Arg Gly Val 
205 

Pro Asp Ser Ala 



Leu Gin Arg Leu 
240 

Phe Pro Asp Gly 
255 

Ser Thr His Cys 
270 

Leu Gin Asp Ser 
285 

Arg Arg Met Lys 



His He Val Gly 
320 

Gly Ser Trp Leu 
335 

Phe Ala His Leu 
350 



Thr lie Asn Ala 
355 

Ser Ser Trp Tyr 
370 

Leu Ser Asn Gly 
385 

Tyr Ala Asn He 



Thr Asp Tyr Leu 
420 

He Pro Ser Ser 
435 

Ser Gly Asn Ser 
450 

Phe Lys Leu Arg 
465 

Ser Leu Leu Asp 



Val Gin Asp He 
500 



Ala Ser He Pro 
360 

His Asp Arg Gly 
375 

Lys Leu Arg Val 
390 

Cys Phe Arg His 
405 

Gin Leu Met Val 



His Asn Leu Met 
440 

Glu Phe His Phe 
455 

Ala Gly Glu Glu 
470 

Pro Asp Gin Asp 
485 

Asp 



Ser Gly Ser His 



Trp Ala Lys He 
380 

Asn Gin Asp Gly 
395 

His Glu Thr Ser 
410 

Tyr Val Val Lys 
425 

Lys Gly Gly Ser 



Tyr Ser He Asn 
460 

He Ser He Gin 
475 

Ala Thr Tyr Phe 
490 



Lys Val Thr Leu 
365 

Ser Asn Met Thr 



Phe Tyr Tyr Leu 
400 

Gly Ser Val Pro 
415 

Thr Ser He Lys 
430 

Thr Lys Asn Trp 
445 

Val Gly Gly Phe 



Val Ser Asn Pro 
480 

Gly Ala Phe Lys 
495 



<210> 5 
<211> 1477 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Murine 

surfactant protein D (except CRD) fused to the 
extracellular domain of murine CD40 ligand 

<220> 

<221> 5'UTR 
<222> (7) . . (31) 

<223> 5' UTR from rat surfactant protein D 
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<220> 

<221> sig_peptide 
<222> (32) . . (88) 

<223> Signal peptide from murine surfactant protein D 

<220> 

<221> CDS 

<222> (32) . . (1441) 

<220> 

<221> misc_recomb 
<222> (88) . . (799) 

<223> Mature murine surfactant protein D including hub 
region, collagenous portion, and neck, but 
excluding carbohydrate recognition domain (CRD) 

<220> 

<221> misc_feature 
<222> (800) . . (1441) 

<223> Murine CD40 ligand extracellular region, including 
stalk 

<300> 

<301> Motwani, M 
White, R A. 
Guo, N 
Dowler, L L. 
Tauber, A I. 
Motwani, M 

<302> Mouse surfactant protein-D, cDNA cloning, 

characterization, and gene localization to chromosome 
14. 

<303> J. Immunol, 
<304> 155 
<305> 12 
<306> 5571-5677 
<307> 1995 
<313> 32 TO 800 

<300> 

<3 01> Armitage, R 
Fanslow, W 
Sato, T A, 
Clifford, K N. 

<302> Molecular and biological characterization of a murine 

ligand for CD40 
<303> Nature 
<304> 357 



14 



<305> 6373 
<306> 80-82 
<307> 1992 
<313> 801 TO 1441 

<400> 5 

gctagcgaat tccaccagga agcaatctga c atg ctg ccc ttt etc tec atg 52 

Met Leu Pro Phe Leu Ser Met 
1 5 

ctt gtc ttg ctt gta cag ccc ctg gga aat ctg gga gca gaa atg aag 100 
Leu Val Leu Leu Val Gin Pro Leu Gly Asn Leu Gly Ala Glu Met Lys 
10 15 20 

age etc teg eag aga tea gta ccc aac acc tgc ace eta gtc atg tgt 148 
Ser Leu Ser Gin Arg Ser Val Pro Asn Thr Cys Thr Leu Val Met Cys 
25 30 35 

age eca aca gag aat ggc ctg eet ggt egt gat gga egg gat ggg aga 19 6 
Ser Pro Thr Glu Asn Gly Leu Pro Gly Arg Asp Gly Arg Asp Gly Arg 
40 45 50 55 

gaa ggt eca egg ggt gag aag ggt gat cca ggt ttg cca gga cot atg 244 
Glu Gly Pro Arg Gly Glu Lys Gly Asp Pro Gly Leu Pro Gly Pro Met 

60 65 70 

ggg etc tea ggg ttg eag gge eet aca ggt cca gtt gga cee aaa gga 292 
Gly Leu Ser Gly Leu Gin Gly Pro Thr Gly Pro Val Gly Pro Lys Gly 
75 80 85 

gag aat ggc tct get ggc gaa cct gga cca aag gga gaa cgt gga eta 340 
Glu Asn Gly Ser Ala Gly Glu Pro Gly Pro Lys Gly Glu Arg Gly Leu 
90 95 100 

gga eet cca gga ctt eca ggt att eet ggt cca get ggg aaa gaa 3 88 
Ser Gly Pro Pro Gly Leu Pro Gly lie Pro Gly Pro Ala Gly Lys Glu 
105 110 115 

ggt cee tct ggg aag cag ggg aac ata gga cct caa gge aaa eca ggt 43 6 
Gly Pro Ser Gly Lys Gin Gly Asn lie Gly Pro Gin Gly Lys Pro Gly 
120 125 130 135 

cct aaa gga gag get ggg ccc aaa gga gaa gta ggt get cct ggc atg 484 
Pro Lys Gly Glu Ala Gly Pro Lys Gly Glu Val Gly Ala Pro Gly Met 

140 145 150 

caa gga tct aca ggg gca aaa ggc tec aca ggc ccc aag gga gaa aga 532 
Gin Gly Ser Thr Gly Ala Lys Gly Ser Thr Gly Pro Lys Gly Glu Arg 
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155 



160 



165 



ggt gcc cct 
Gly Ala Pro 
170 

cct gcc gga 
Pro Ala Gly 
185 

cca gga etc 
Pro Gly Leu 
200 

ggt gaa age 
Gly Glu Ser 

gcc tta aaa 
Ala Leu Lys 

cag aaa get 
Gin Lys Ala 
250 

gaa gag gaa 
Glu Glu Glu 
265 

aag aga tgc 
Lys Arg Cys 
280 

gag atg aga 
Glu Met Arg 



aaa gaa gag 
Lys Glu Glu 

gat cct caa 
Asp Pro Gin 
330 

gca tec gtt 
Ala Ser Val 



ggt gtg eaa gga 
Gly Val Gin Gly 



cct gcc ggt cca 
Pro Ala Gly Pro 
190 

aga 

Lys Gly Asp Arg 
205 

ggg ctt cca gae 
Gly Leu Pro Asp 
220 

gga aaa eta cag 
Gly Lys Leu Gin 
235 

gca ttg ttc cct 
Ala Leu Phe Pro 



gta aae ctt eat 
Val Asn Leu His 
270 

aac aaa gga gaa 
Asn Lys Gly Glu 
285 

agg caa ttt gaa 
Arg Gin Phe Glu 
300 

aaa aaa gaa aac 
Lys Lys Glu Asn 
315 

att gca gca eac 
He Ala Ala His 



eta cag tgg gcc 
Leu Gin Trp Ala 



gee cca ggg aat 
Ala Pro Gly Asn 
175 

cag gga get cca 
Gin Gly Ala Pro 



ggt gtt cct gga 
Gly Val Pro Gly 
210 

agt get get etg 
Ser Ala Ala Leu 
225 

cgt eta gag gtt 
Arg Leu Glu Val 
240 

gat gge eat aga 
Asp Gly His Arg 
255 

gaa gat ttt gta 
Glu Asp Phe Val 

gga tot tta tec 
Gly Ser Leu Ser 
290 

gae ctt gte aag 
Asp Leu Val Lys 
305 

age ttt gaa atg 
Ser Phe Glu Met 
320 

gtt gta age gaa 
Val Val Ser Glu 
335 

aag aaa gga tat 
Lys Lys Gly Tyr 



get gga gca 
Ala Gly Ala 
180 

ggt tec agg 
Gly Ser Arg 
195 

gae aga gga 
Asp Arg Gly 



agg cag cag 
Arg Gin Gin 



gee ttc tee 
Ala Phe Ser 
245 

aga ttg gat 
Arg Leu Asp 
260 

ttc ata aaa 
Phe He Lys 
275 

ttg ctg aac 
Leu Leu Asn 

gat ata acg 
Asp lie Thr 



caa aga ggt 
Gin Arg Gly 
325 

gee aac agt 
Ala Asn Ser 
340 

tat acc atg 
Tyr Thr Met 



gca gga 580 
Ala Gly 



ggg ccc 62 8 
Gly Pro 

ate aaa 67 6 
lie Lys 
215 

atg gag 72 4 

Met Glu 

230 

eac tat 7 72 
His Tyr 

aag gte 82 0 
Lys Val 

aag eta 868 
Lys Leu 

tgt gag 916 
Cys Glu 
295 

tta aae 964 
Leu Asn 
310 

gat gag 1012 
Asp Glu 

aat gca 1060 
Asn Ala 

aaa age 1108 
Lys Ser 
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345 350 355 

aac ttg gta atg ctt gaa aat ggg aaa cag ctg acg gtt aaa aga gaa 1156 
Asn Leu Val Met Leu Glu Asn Gly Lys Gin Leu Thr Val Lys Arg Glu 
360 365 370 375 

gga etc tat tat gtc tac act caa gtc acc ttc tgc tct aat egg gag 1204 
Gly Leu Tyr Tyr Val Tyr Thr Gin Val Thr Phe Cys Ser Asn Arg Glu 

380 385 390 

cct teg agt eaa cge eca ttc ate gte ggc etc tgg ctg aag ecc age 1252 
Pro Ser Ser Gin Arg Pro Phe lie Val Gly Leu Trp Leu Lys Pro Ser 
395 400 405 

att gga tet gag aga ate tta etc aag geg gea aat acc eac agt tee 1300 
lie Gly Ser Glu Arg lie Leu Leu Lys Ala Ala Asn Thr His Ser Ser 
410 415 420 

tec cag ctt tgc gag cag cag tct gtt cac ttg ggc gga gtg ttt gaa 1348 
Ser Gin Leu Cys Glu Gin Gin Ser Val His Leu Gly Gly Val Phe Glu 
425 430 435 

tta eaa get ggt get tet gtg ttt gte aac gtg act gaa gea age caa 13 9 6 
Leu Gin Ala Gly Ala Ser Val Phe Val Asn Val Thr Glu Ala Ser Gin 
440 445 450 455 

gtg ate cac aga gtt ggc ttc tea tct ttt ggc tta etc aaa etc 1441 
Val lie His Arg Val Gly Phe Ser Ser Phe Gly Leu Leu Lys Leu 

460 465 470 

tgaacagtge gctgtectag getgeagcag ggtace 1477 



<210> 6 
<211> 470 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Murine 

surfactant protein D (except CRD) fused to the 
extracellular domain of murine CD40 ligand 

<400> 6 

Met Leu Pro Phe Leu Ser Met Leu Val Leu Leu Val Gin Pro Leu Gly 
15 10 15 

Asn Leu Gly Ala Glu Met Lys Ser Leu Ser Gin Arg Ser Val Pro Asn 
20 25 30 
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Thr Cys Thr Leu Val Met Cys Ser Pro Thr Glu Asn Gly Leu Pro Gly 
35 40 45 



Arg Asp Gly Arg Asp Gly Arg Glu Gly Pro Arg Gly Glu Lys Gly Asp 
50 55 60 

Pro Gly Leu Pro Gly Pro Met Gly Leu Ser Gly Leu Gin Gly Pro Thr 
65 70 75 80 

Gly Pro Val Gly Pro Lys Gly Glu Asn Gly Ser Ala Gly Glu Pro Gly 

85 90 95 

Pro Lys Gly Glu Arg Gly Leu Ser Gly Pro Pro Gly Leu Pro Gly lie 
100 105 110 

Pro Gly Pro Ala Gly Lys Glu Gly Pro Ser Gly Lys Gin Gly Asn lie 
115 120 125 

Gly Pro Gin Gly Lys Pro Gly Pro Lys Gly Glu Ala Gly Pro Lys Gly 
130 135 140 

Glu Val Gly Ala Pro Gly Met Gin Gly Ser Thr Gly Ala Lys Gly Ser 
145 150 155 160 

Thr Gly Pro Lys Gly Glu Arg Gly Ala Pro Gly Val Gin Gly Ala Pro 

165 170 175 

Gly Asn Ala Gly Ala Ala Gly Pro Ala Gly Pro Ala Gly Pro Gin Gly 
180 185 190 

Ala Pro Gly Ser Arg Gly Pro Pro Gly Leu Lys Gly Asp Arg Gly Val 
195 ' 200 205 

Pro Gly Asp Arg Gly lie Lys Gly Glu Ser Gly Leu Pro Asp Ser Ala 
210 215 220 

Ala Leu Arg Gin Gin Met Glu Ala Leu Lys Gly Lys Leu Gin Arg Leu 
225 230 235 240 

Glu Val Ala Phe Ser His Tyr Gin Lys Ala Ala Leu Phe Pro Asp Gly 

245 250 255 

His Arg Arg Leu Asp Lys Val Glu Glu Glu Val Asn Leu His Glu Asp 
260 265 270 



Phe Val Phe lie Lys Lys Leu Lys Arg Cys Asn Lys Gly Glu Gly Ser 
275 280 285 
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Leu Ser Leu Leu Asn Cys Glu Glu Met Arg Arg Gin Phe Glu Asp Leu 
290 295 300 



Val Lys Asp lie 
305 

Glu Met Gin Arg 



Ser Glu Ala Asn 
340 

Gly Tyr Tyr Thr 
355 

Gin Leu Thr Val 
370 

Thr Phe Cys Ser 
385 

Gly Leu Trp Leu 



Ala Ala Asn Thr 
420 

His Leu Gly Gly 
435 

Asn Val Thr Glu 
450 

Phe Gly Leu Leu 
465 



Thr Leu Asn Lys 
310 

Gly Asp Glu Asp 
325 

Ser Asn Ala Ala 



Met Lys Ser Asn 
360 

Lys Arg Glu Gly 
375 

Asn Arg Glu Pro 
390 

Lys Pro Ser lie 
405 

His Ser Ser Ser 



Val Phe Glu Leu 
440 

Ala Ser Gin Val 
455 

Lys Leu 
47 0 



Glu Glu Lys Lys 
315 

Pro Gin lie Ala 
330 

Ser Val Leu Gin 
345 

Leu Val Met Leu 



Leu Tyr Tyr Val 
380 

Ser Ser Gin Arg 
395 

Gly Ser Glu Arg 
410 

Gin Leu Cys Glu 
425 

Gin Ala Gly Ala 



He His Arg Val 
460 



Glu Asn Ser Phe 
320 

Ala His Val Val 
335 

Trp Ala Lys Lys 
350 

Glu Asn Gly Lys 
365 

Tyr Thr Gin Val 



Pro Phe He Val 
400 

He Leu Leu Lys 
415 

Gin Gin Ser Val 
430 

Ser Val Phe Val 
445 

Gly Phe Ser Ser 
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SEQUENCE LISTING 




TRj^n 0> Kombluth, Richard S 

<120> Multimeric forms of CD40L and other TNF family members 

<130> TNFSF-colIectin fusion proteins 

<140> 60/111,471 
<141> 1998-12-08 

<160>6 

<170>PatentInVer. 2.1 

<210> 1 

<211>1552 

<212>DNA 

<213> Artificial Sequence 
<220> 

<221> 5'UTR 
<222> (7)..(31) 

<220> 

<221> misc_feature 
<222> (88)..(799) 

<223> Mature murine surfactant protein D including hub 
region, collagenous portion, and neck, but 
excluding carbohydrate recognition domain (CRD) 

<220> 

<221> misc_feature 
<222> (801)..(1546) 

<223> Human CD40 ligand extracellular region, including 



<220> 

<221> sig_peptide 
<222> (32)..(88) 

<223> Signal peptide from murine surfactant protein D 

<220> 
<221> CDS 
<222> (32)..(1444) 

<220> 

<223> Description of Artificial Sequence: Murine 
surfactant protein D (without the CRD) fused to 
the extracellular portion of human CD40L 

<300> 

<301> Spriggs, Melanie K. 
Armitage, Richard J. 



stalk. 



Strockbine, L 

Clifford, KN, 

Macduff, B M. 

Sato, T A. 

Maliszewski, C R. 

Fanslow, William C. 
<302> Recombinant human CD40 ligand stimulates B cell 

proliferation and immunoglobulin E secretion. 
<303> J. Exp. Med. 
<304> 176 
<305> 6 

<306> 1543-1550 
<307> 1992 
<313> 801 TO 1600 

<300> 

<301>Motwani, M 
White, R A. 

<302> Mouse surfactant protein-D. cDNA cloning, 
characterization, and gene localization to chromosome 
14. 

<303> J. Immunol. 
<304> 155 
<305> 12 
<306> 5671-5677 
<307> 1995 
<313> 32 TO 800 

<400> 1 

gctagcgaat tccaccagga agcaatctga c atg ctg ccc ttt etc tec atg 52 

Met Leu Pro Phe Leu Ser Met 
1 5 

ctt gtc ttg ctt gta cag ccc ctg gga aat ctg gga gca gaa atg aag 100 
Leu Val Leu Leu Val Gin Pro Leu Gly Asn Leu Gly Ala Glu Met Lys 
10 15 20 

age etc teg cag aga tea gta ccc aac ace tgc acc eta gtc atg tgt 148 
Ser Leu Ser Ghi Arg Ser Val Pro Asn Thr Cys Thr Leu Val Met Cys 
25 30 35 

age cca aca gag aat ggc ctg cct ggt cgt gat gga egg gat ggg aga 196 
Ser Pro Thr Glu Asn Gly Leu Pro Gly Arg Asp Gly Arg Asp Gly Arg 
40 45 50 55 

gaa ggt cca egg ggt gag aag ggt gat cca ggt ttg cca gga cct atg 244 
Glu Gly Pro Arg Gly Glu Lys Gly Asp Pro Gly Leu Pro Gly Pro Met 
60 65 70 

ggg etc tea ggg ttg cag ggc cct aca ggt cca gtt gga ccc aaa gga 292 
Gly Leu Ser Gly Leu Ghi Gly Pro Thr Gly Pro Val Gly Pro Lys Gly 
75 80 85 



gag aat ggc tct get ggc gaa cct gga cca aag gga gaa cgt gga eta 340 



Glu Asn Gly Ser Ala Gly Glu Pro Gly Pro Lys Gly Glu Arg Gly Leu 
90 95 100 



agt gga cct cca gga ctt cca ggt att cct ggt cca get ggg aaa gaa 388 
Ser Gly Pro Pro Gly Leu Pro Gly He Pro Gly Pro Ala Gly Lys Glu 
105 110 115 

ggt ccc tct ggg aag cag ggg aac ata gga cct caa ggc aaa cca ggt 436 
Gly Pro Ser Gly Lys Gin Gly Asn lie Gly Pro Gin Gly Lys Pro Gly 
120 125 130 135 

cct aaa gga gag get ggg ccc aaa gga gaa gta ggt get cct ggc atg 484 
Pro Lys Gly Glu Ala Gly Pro Lys Gly Glu Val Gly Ala Pro Gly Met 
140 145 150 

caa gga tct aca ggg gea aaa ggc tec aca ggc ccc aag gga gaa aga 532 
Gin Gly Ser Thr Gly Ala Lys Gly Ser Thr Gly Pro Lys Gly Glu Arg 
155 160 165 

ggt gee cct ggt gtg caa gga gee cca ggg aat get gga gea gea gga 580 
Gly Ala Pro Gly Val Gin Gly Ala Pro Gly Asn Ala Gly Ala Ala Gly 
170 175 180 

cct gee gga cct gee ggt cca cag gga get cca ggt tec agg ggg ccc 628 
Pro Ala Gly Pro Ala Gly Pro Gin Gly Ala Pro Gly Ser Arg Gly Pro 
185 190 195 

cca gga etc aag ggg gac aga ggt gtt cct gga gac aga gga ate aaa 676 
Pro Gly Leu Lys Gly Asp Arg Gly Val Pro Gly Asp Arg Gly He Lys 
200 205 210 215 

ggt gaa age ggg ctt cca gac agt get get ctg agg cag cag atg gag 724 
Gly Glu Ser Gly Leu Pro Asp Ser Ala Ala Leu Arg Gin Gin Met Glu 
220 225 230 

gee tta aaa gga aaa eta eag cgt eta gag gtt gee ttc tec cae tat 772 
Ala Leu Lys Gly Lys Leu Gin Arg Leu Glu Val Ala Phe Ser His Tyr 
235 240 245 

cag aaa get gea ttg ttc cct gat ggc cat aga agg ttg gac aag ata 820 
Gin Lys Ala Ala Leu Phe Pro Asp Gly His Arg Arg Leu Asp Lys He 
250 255 260 

gaa gat gaa agg aat ctt cat gaa gat ttt gta ttc atg aaa acg ata 868 
Glu Asp Glu Arg Asn Leu His Glu Asp Phe Val Phe Met Lys Thr lie 
265 270 275 

eag aga tgc aac aca gga gaa aga tec tta tec tta ctg aac tgt gag 916 
Gin Arg Cys Asn Thr Gly Glu Arg Ser Leu Ser Leu Leu Asn Cys Glu 
280 285 290 295 

gag att aaa age cag ttt gaa ggc ttt gtg aag gat ata atg tta aac 964 
Glu He Lys Ser Ghi Phe Glu Gly Phe Val Lys Asp He Met Leu Asn 
300 305 310 



aaa gag gag acg aag aaa gaa aac age ttt gaa atg caa aaa ggt gat 1012 
Lys Glu Glu Thr Lys Lys Glu Asn Ser Phe Glu Met Gin Lys Gly Asp 
315 320 325 

cag aat cct caa att geg gca cat gtc ata agt gag gcc age agt aaa 1060 
Gin Asn Pro Gin lie Ala Ala His Val He Ser Glu Ala Ser Ser Lys 
330 335 340 

aca aca tct gtg tta cag tgg get gaa aaa gga tac tac acc atg age 1 108 
Thr Thr Ser Val Leu Gin Trp Ala Glu Lys Gly Tyr Tyr Thr Met Ser 
345 350 355 

aac aac ttg gta acc ctg gaa aat ggg aaa cag ctg acc gtt aaa aga 1 156 
Asn Asn Leu Val Thr Leu Glu Asn Gly Lys Gin Leu Thr Val Lys Arg 
360 365 370 375 

caa gga etc tat tat ate tat gcc caa gtc acc ttc tgt tec aat egg 1204 
Gin Gly Leu Tyr Tyr lie Tyr Ala Gin Val Thr Phe Cys Ser Asn Arg 
380 385 390 

gaa get teg agt caa get cca ttt ata gcc age etc tgc eta aag tec 1252 
Glu Ala Ser Ser Ghi Ala Pro Phe He Ala Ser Leu Cys Leu Lys Ser 
395 400 405 

ccc ggt aga ttc gag aga ate tta etc aga get gca aat acc cac agt 1 300 
Pro Gly Arg Phe Glu Arg He Leu Leu Arg Ala Ala Asn Thr His Ser 
410 415 420 

tec gcc aaa cct tgc ggg caa caa tec att cac ttg gga gga gta ttt 1348 
Ser Ala Lys Pro Cys Gly Ghi Ghi Ser He His Leu Gly Gly Val Phe 
425 430 435 

gaa ttg caa cca ggt get teg gtg ttt gtc aat gtg act gat cca age 1396 
Glu Leu Gin Pro Gly Ala Ser Val Phe Val Asn Val Thr Asp Pro Ser 
440 445 450 455 

caa gtg age cat ggc act ggc ttc acg tec ttt ggc tta etc aaa etc 1444 
Ghi Val Ser His Gly Thr Gly Phe Thr Ser Phe Gly Leu Leu Lys Leu 
460 465 470 

tgaacagtgt caccttgcag gctgtggtgg agctgacgct gggagtctte ataatacage 1504 

acaggcttaa gcccaattat acactecaag gcatgtagaa etggtacc 1552 



<210>2 

<211>471 

<212>PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Muruie 
surfactant protein D (without the CRD) fused to 
the extracellular portion of human CD40L 



<400> 2 

Met Leu Pro Phe Leu Ser Met Leu Val Leu Leu Val Gin Pro Leu Gly 
1 5 10 15 

Asn Leu Gly Ala GIu Met Lys Ser Leu Ser Gin Arg Ser Val Pro Asn 
20 25 30 

Thr Cys Thr Leu Val Met Cys Ser Pro Thr Glu Asn Gly Leu Pro Gly 
35 40 45 

Arg Asp Gly Arg Asp Gly Arg Glu Gly Pro Arg Gly Glu Lys Gly Asp 
50 55 60 

Pro Gly Leu Pro Gly Pro Met Gly Leu Ser Gly Leu Gin Gly Pro Thr 
65 70 75 80 

Gly Pro Val Gly Pro Lys Gly Glu Asn Gly Ser Ala Gly Glu Pro Gly 
85 90 95 

Pro Lys Gly Glu Arg Gly Leu Ser Gly Pro Pro Gly Leu Pro Gly He 
100 105 110 

Pro Gly Pro Ala Gly Lys Glu Gly Pro Ser Gly Lys Gin Gly Asn He 
115 120 125 

Gly Pro Gin Gly Lys Pro Gly Pro Lys Gly Glu Ala Gly Pro Lys Gly 
130 135 140 

Glu Val Gly Ala Pro Gly Met Gin Gly Ser Thr Gly Ala Lys Gly Ser 
145 150 155 160 

Thr Gly Pro Lys Gly Glu Arg Gly Ala Pro Gly Val Ghi Gly Ala Pro 
165 170 175 

Gly Asn Ala Gly Ala Ala Gly Pro Ala Gly Pro Ala Gly Pro Ghi Gly 
180 185 190 

Ala Pro Gly Ser Arg Gly Pro Pro Gly Leu Lys Gly Asp Arg Gly Val 
195 200 205 

Pro Gly Asp Arg Gly He Lys Gly Glu Ser Gly Leu Pro Asp Ser Ala 
210 215 220 

Ala Leu Arg Ghi Ghi Met Glu Ala Leu Lys Gly Lys Leu Ghi Arg Leu 
225 230 235 240 

Glu Val Ala Phe Ser His Tyr Ghi Lys Ala Ala Leu Phe Pro Asp Gly 
245 250 255 

His Arg Arg Leu Asp Lys He Glu Asp Glu Arg Asn Leu His Glu Asp 
260 265 270 

Phe Val Phe Met Lys Thr He Ghi Arg Cys Asn Thr Gly Glu Arg Ser 
275 280 285 



Leu Ser Leu Leu Asn Cys Glu Glu He Lys Ser Gin Phe Glu Gly Phe 
290 295 300 

Val Lys Asp He Met Leu Asn Lys Glu Glu Thr Lys Lys Glu Asn Ser 
305 310 315 320 

Phe Glu Met Gin Lys Gly Asp Gin Asn Pro Gin lie Ala Ala His Val 
325 330 335 

He Ser Glu Ala Ser Ser Lys Thr Thr Ser Val Leu Gbi Trp Ala Glu 
340 345 350 

Lys Gly Tyr Tyr Thr Met Ser Asn Asn Leu Val Thr Leu Glu Asn Gly 
355 360 365 

Lys Ghi Leu Thr Val Lys Arg Ghi Gly Leu Tyr Tyr lie Tyr Ala Gb 
370 375 380 

Val Thr Phe Cys Ser Asn Arg Glu Ala Ser Ser Gin Ala Pro Phe lie 
385 390 395 400 

Ala Ser Leu Cys Leu Lys Ser Pro Gly Arg Phe Glu Arg lie Leu Leu 
405 410 415 

Arg Ala Ala Asn Thr His Ser Ser Ala Lys Pro Cys Gly Ghi Ghi Ser 
420 425 430 

He His Leu Gly Gly Val Phe Glu Leu Ghi Pro Gly Ala Ser Val Phe 
435 440 445 

Val Asn Val Thr Asp Pro Ser Ghi Val Ser His Gly Thr Gly Phe Thr 
450 455 460 

Ser Phe Gly Leu Leu Lys Leu 
465 470 



<210>3 
<211>1574 ^ 
<212>DNA 

<213> Artificial Sequence 
<220> 

<221>5'UTR 
<222> (7).. (31) 

<223> 5' UTR taken from rat sequence for surfactant 
protein D 

<220> 

<221>sig_peptide 
<222> (32)..(88) 

<223> Signal peptide from murme surfactant protein D 



<220> 
<221>CDS 
<222> (32)..(1534) 

<220> 

<221> misc_feature 
<222> (32)..(800) 

<223> Murine surfactant protein D including hub region, 
collagenous portion, and neck, but excluding 
carbohydrate recognition domain (CRD) 

<220> 

<221> misc_feature 
<222> (801)..(1534) 

<223> Murine RANKL/TRANCE extracellular region, 
including stalk 

<300> 

<301>Motwani, M 
White, R A. 
Guo,N 
Dowler, L L. 
Tauber, A L 
Motwani, M 

<302> Mouse surfactant protein-D, cDNA cloning, 
characterization, and gene localization to chromosome 
14, 

<303> J. Immunol. 
<304> 155 
<305> 12 
<306> 5671-5677 
<307> 1995 
<313> 32 TO 800 

<300> 

<301> Anderson, D M. 
Maraskovsky, E 
Billingsley, W L. 
Dougall,WC. 

<302> A homologue of the TNfF receptor and its ligand enhance 

T-cell growth and dendritic-cell function. 
<303> Nature 
<304> 390 
<305> 6656 
<306> 175-179 
<307> 1997 
<313> 801 TO 1534 

<400> 3 

gctagcgaat tccaccagga agcaatctga c atg ctg ccc ttt etc tec atg 52 

Met Leu Pro Phe Leu Ser Met 
1 5 



ctt gtc ttg ctt gta cag ccc ctg gga aat ctg gga gca gaa atg aag 100 
Leu Val Leu Leu Val Gin Pro Leu Gly Asn Leu Gly Ala Glu Met Lys 
10 15 20 

age etc teg cag aga tea gta ccc aac acc tgc acc eta gtc atg tgt 148 
Ser Leu Ser Gin Arg Ser Val Pro Asn Thr Cys Thr Leu Val Met Cys 
25 30 35 

age cca aca gag aat ggc ctg cct ggt cgt gat gga egg gat ggg aga 1 96 
Ser Pro Thr Glu Asn Gly Leu Pro Gly Arg Asp Gly Arg Asp Gly Arg 
40 45 50 55 

gaa ggt cca egg ggt gag aag ggt gat cca ggt ttg cca gga cct atg 244 
Glu Gly Pro Arg Gly Glu Lys Gly Asp Pro Gly Leu Pro Gly Pro Met 
60 65 70 

ggg etc tea ggg ttg cag ggc cct aca ggt cca gtt gga ccc aaa gga 292 
Gly Leu Ser Gly Leu Gin Gly Pro Thr Gly Pro Val Gly Pro Lys Gly 
75 80 85 

gag aat ggc tct get ggc gaa cct gga cca aag gga gaa cgt gga eta 340 
Glu Asn Gly Ser Ala Gly Glu Pro Gly Pro Lys Gly Glu Arg Gly Leu 
90 95 100 

agt gga cct cca gga ctt cca ggt att cct ggt cca get ggg aaa gaa 388 
Ser Gly Pro Pro Gly Leu Pro Gly He Pro Gly Pro Ala Gly Lys Glu 
105 110 115 

ggt ccc tct ggg aag cag ggg aac ata gga cct caa ggc aaa cca ggt 436 
Gly Pro Ser Gly Lys Gin Gly Asn lie Gly Pro Gin Gly Lys Pro Gly 
120 125 130 135 

cct aaa gga gag get ggg ccc aaa gga gaa gta ggt get cct ggc atg 484 
Pro Lys Gly Glu Ala Gly Pro Lys Gly Glu Val Gly Ala Pro Gly Met 
140 145 150 

caa gga tct aca ggg gca aaa ggc tec aca ggc ccc aag gga gaa aga 532 
Gin Gly Ser Thr Gly Ala Lys Gly Ser Thr Gly Pro Lys Gly Glu Arg 
155 160 165 

ggt gcc cct ggt gtg caa gga gcc cca ggg aat get gga gca gca gga 580 
Gly Ala Pro Gly Val Ghi Gly Ala Pro Gly Asn Ala Gly Ala Ala Gly 
170 175 180 

cct gcc gga cct gcc ggt cca cag gga get cca ggt tec agg ggg ccc 628 
Pro Ala Gly Pro Ala Gly Pro Ghi Gly Ala Pro Gly Ser Arg Gly Pro 
185 190 195 

cca gga etc aag ggg gac aga ggt gtt cct gga gac aga gga ate aaa 676 
Pro Gly Leu Lys Gly Asp Arg Gly Val Pro Gly Asp Arg Gly He Lys 
200 205 210 215 

ggt gaa age ggg ctt cca gac agt get get ctg agg cag cag atg gag 724 
Gly Glu Ser Gly Leu Pro Asp Ser Ala Ala Leu Arg Gk Ghi Met Glu 



220 225 230 



gcc tta aaa gga aaa eta cag cgt eta gag gtt gee ttc tec cac tat 772 
Ala Leu Lys Gly Lys Leu Gin Arg Leu Glu Val Ala Phe Ser His Tyr 
235 240 245 

cag aaa get gca ttg ttc cet gat ggt cga gcg cag atg gat cet aac 820 
Gin Lys Ala Ala Leu Phe Pro Asp Gly Arg Ala Gin Met Asp Pro Asn 
250 255 260 

aga ata tea gaa gae age act cac tgc ttt tat aga ate ctg aga etc 868 
Arg lie Ser Glu Asp Ser Thr His Cys Phe Tyr Arg He Leu Arg Leu 
265 270 275 

cat gaa aac gca ggt ttg cag gae teg act ctg gag agt gaa gae aea 916 
His Glu Asn Ala Gly Leu Ghi Asp Ser Thr Leu Glu Ser Glu Asp Thr 
280 285 290 295 

eta cet gae tec tgc agg agg atg aaa caa gcc ttt cag ggg gee gtg 964 
Leu Pro Asp Ser Cys Arg Arg Met Lys Ghi Ala Phe Gin Gly Ala Val 
300 305 310 

cag aag gaa ctg caa cac att gtg ggg cca cag egc ttc tea gga get 1012 
Ghi Lys Glu Leu Ghi His lie Val Gly Pro Ghi Arg Phe Ser Gly Ala 
315 320 325 

eca get atg atg gaa gge tea tgg ttg gat gtg gcc cag cga ggc aag 1060 
Pro Ala Met Met Glu Gly Ser Trp Leu Asp Val Ala Gb Arg Gly Lys 
330 335 340 

cet gag gcc cag cca ttt gca cac etc ace ate aat get gcc age ate 11 08 
Pro Glu Ala Ghi Pro Phe Ala His Leu Thr He Asn Ala Ala Ser lie 
345 350 355 

eca teg ggt tec eat aaa gtc act ctg tec tct tgg tac cac gat cga 1 1 56 
Pro Ser Gly Ser His Lys Val Thr Leu Ser Ser Trp Tyr His Asp Arg 
360 365 370 375 

gge tgg gcc aag ate tct aac atg acg tta age aac gga aaa eta agg 1204 
Gly Trp Ala Lys He Ser Asn Met Thr Leu Ser Asn Gly Lys Leu Arg 
380 385 390 

gtt aac caa gat ggc ttc tat tac ctg tac gcc aac att tgc ttt egg 1252 
Val Asn Gin Asp Gly Phe Tyr Tyr Leu Tyr Ala Asn lie Cys Phe Arg 
395 400 405 

eat cat gaa aea teg gga age gta cet aea gae tat ctt cag ctg atg 1300 
His His Glu Thr Ser Gly Ser Val Pro Thr Asp Tyr Leu Ghi Leu Met 
410 415 420 

gtg tat gtc gtt aaa ace age ate aaa ate cca agt tct cat aac ctg 1348 
Val Tyr Val Val Lys Thr Ser He Lys He Pro Ser Ser His Asn Leu 
425 430 435 



atg aaa gga ggg age acg aaa aac tgg teg ggc aat tct gaa ttc eae 1396 
Met Lys Gly Gly Ser Thr Lys Asn Trp Ser Gly Asn Ser Glu Phe His 
440 445 450 455 

ttt tat tec ata aat gtt ggg gga ttt ttc aag etc ega get ggt gaa 1444 
Phe Tyr Ser lie Asn Val Gly Gly Phe Phe Lys Leu Arg Ala Gly Glu 
460 465 470 

gaa att age att cag gtg tec aac cct tec ctg ctg gat ceg gat caa 1492 
Glu He Ser He Ghi Val Ser Asn Pro Ser Leu Leu Asp Pro Asp Ghi 
475 480 485 

gat gcg acg tac ttt ggg get ttc aaa gtt cag gac ata gac 1534 
Asp Ala Thr Tyr Phe Gly Ala Phe Lys Val Gin Asp He Asp 
490 495 500 

tgagactcat ttcgtggaac attagcggcc gctaaactat 1 574 



<210>4 

<211>501 

<212>PRT 

<213> Artificial Sequence 
<400> 4 

Met Leu Pro Phe Leu Ser Met Leu Val Leu Leu Val Ghi Pro Leu Gly 
1 5 10 15 

Asn Leu Gly Ala Glu Met Lys Ser Leu Ser Ghi Arg Ser Val Pro Asn 
20 25 30 

Thr Cys Thr Leu Val Met Cys Ser Pro Thr Glu Asn Gly Leu Pro Gly 
35 40 45 

Arg Asp Gly Arg Asp Gly Arg Glu Gly Pro Arg Gly Glu Lys Gly Asp 
50 55 60 

Pro Gly Leu Pro Gly Pro Met Gly Leu Ser Gly Leu Ghi Gly Pro Thr 
65 70 75 80 

Gly Pro Val Gly Pro Lys Gly Glu Asn Gly Ser Ala Gly Glu Pro Gly 
85 90 95 

Pro Lys Gly Glu Arg Gly Leu Ser Gly Pro Pro Gly Leu Pro Gly He 
100 105 110 

Pro Gly Pro Ala Gly Lys Glu Gly Pro Ser Gly Lys Ghi Gly Asn He 
115 120 125 

Gly Pro Ghi Gly Lys Pro Gly Pro Lys Gly Glu Ala Gly Pro Lys Gly 
130 135 140 

Glu Val Gly Ala Pro Gly Met Gin Gly Ser Thr Gly Ala Lys Gly Ser 
145 150 155 160 



Thr Gly Pro Lys Gly Glu Arg Gly Ala Pro Gly Val Gin Gly Ala Pro 
165 170 175 

Gly Asn Ala Gly Ala Ala Gly Pro Ala Gly Pro Ala Gly Pro Gin Gly 
180 185 190 

Ala Pro Gly Ser Arg Gly Pro Pro Gly Leu Lys Gly Asp Arg Gly Val 
195 200 205 

Pro Gly Asp Arg Gly He Lys Gly Glu Ser Gly Leu Pro Asp Ser Ala 
210 215 220 

Ala Leu Arg Gin Gin Met Glu Ala Leu Lys Gly Lys Leu Gin Arg Leu 
225 230 235 240 

Glu Val Ala Phe Ser His Tyr Gin Lys Ala Ala Leu Phe Pro Asp Gly 
245 250 255 

Arg Ala Gin Met Asp Pro Asn Arg He Ser Glu Asp Ser Thr His Cys 
260 265 270 

Phe Tyr Arg He Leu Arg Leu His Glu Asn Ala Gly Leu Gin Asp Ser 
275 280 285 

Thr Leu Glu Ser Glu Asp Thr Leu Pro Asp Ser Cys Arg Arg Met Lys 
290 295 300 

Ghi Ala Phe Gin Gly Ala Val Ghi Lys Glu Leu Ghi His He Val Gly 
305 310 315 320 

Pro Ghi Arg Phe Ser Gly Ala Pro Ala Met Met Glu Gly Ser Trp Leu 
325 330 335 

Asp Val Ala Ghi Arg Gly Lys Pro Glu Ala Ghi Pro Phe Ala His Leu 
340 345 350 

Thr He Asn Ala Ala Ser He Pro Ser Gly Ser His Lys Val Thr Leu 
355 360 365 

Ser Ser Trp Tyr His Asp Arg Gly Trp Ala Lys He Ser Asn Met Thr 
370 375 380 

Leu Ser Asn Gly Lys Leu Arg Val Asn Gin Asp Gly Phe Tyr Tyr Leu 
385 390 395 400 

Tyr Ala Asn He Cys Phe Arg His His Glu Thr Ser Gly Ser Val Pro 
405 410 415 

Thr Asp Tyr Leu Gin Leu Met Val Tyr Val Val Lys Thr Ser He Lys 
420 425 430 

He Pro Ser Ser His Asn Leu Met Lys Gly Gly Ser Thr Lys Asn Tip 
435 440 445 



Ser Gly Asn Ser Glu Phe His Phe Tyr Ser He Asn Val Gly Gly Phe 
450 455 460 

Phe Lys Leu Arg Ala Gly Glu Glu He Ser He Gin Val Ser Asn Pro 
465 470 475 480 

Ser Leu Leu Asp Pro Asp Gin Asp Ala Thr Tyr Phe Gly Ala Phe Lys 
485 490 495 

Val Gin Asp He Asp 
500 



<210> 5 
<211> 1477 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Murine 
surfactant protein D (except CRD) fused to the 
extracellular domain of murine CD40 ligand 

<220> 

<221> 5'UTR 
<222> (7)..(31) 

<223> 5' UTR from rat surfactant protein D 

<220> 

<221>sig_j)eptide 
<222> (32)..(88) 

<223> Signal peptide from murine surfactant protein D 

<220> 
<221>CDS 
<222> (32)..(1441) 

<220> 

<221> misc_recomb 
<222> (88),.(799) 

<223> Mature murine surfactant protein D including hub 
region, collagenous portion, and neck, but 
excluding carbohydrate recognition domain (CRD) 

<220> 

<221> misc_feature 
<222> (800)..(1441) 

<223> Murine CD40 ligand extracellular region, including 
stalk 

<300> 

<301>Motwani,M 



White, R A. 
Guo,N 
Dowler, L L. 
Tauber, A 1. 
Motwani, M 

<302> Mouse surfactant protein-D, cDNA cloning, 
characterization, and gene localization to chromosome 
14. 

<303> J. Immunol. 
<304> 155 
<305> 12 
<306> 5671-5677 
<307> 1995 
<313>32TO 800 

<300> 

<301>Armitage,R 
Fanslow, W 
Sato, T A. 
Clifford, KN. 

<302> Molecular and biological characterization of a murine 

ligand for CD40 
<303> Nature 
<304> 357 
<305> 6373 
<306> 80-82 
<307> 1992 
<313> 801 TO 1441 

<400> 5 

gctagcgaat tccaccagga agcaatctga c atg ctg ccc ttt etc tec atg 52 

Met Leu Pro Phe Leu Ser Met 
1 5 

ctt gtc ttg ctt gta cag ccc ctg gga aat ctg gga gca gaa atg aag 100 
Leu Val Leu Leu Val Ghi Pro Leu Gly Asn Leu Gly Ala Glu Met Lys 
10 15 20 

age etc teg cag aga tea gta ccc aac acc tgc ace eta gtc atg tgt 148 
Ser Leu Ser Ghi Arg Ser Val Pro Asn Thr Cys Thr Leu Val Met Cys 
25 30 35 

age cca aca gag aat ggc ctg cct ggt cgt gat gga egg gat ggg aga 196 
Ser Pro Thr Glu Asn Gly Leu Pro Gly Arg Asp Gly Arg Asp Gly Arg 
40 45 50 55 

gaa ggt cca egg ggt gag aag ggt gat cca ggt ttg cca gga cct atg 244 
Glu Gly Pro Arg Gly Glu Lys Gly Asp Pro Gly Leu Pro Gly Pro Met 
60 65 70 

ggg etc tea ggg ttg cag ggc cct aca ggt cca gtt gga ccc aaa gga 292 
Gly Leu Ser Gly Leu Gin Gly Pro Thr Gly Pro Val Gly Pro Lys Gly 
75 80 85 



gag aat ggc tct get ggc gaa cct gga cca aag gga gaa cgt gga eta 340 
Glu Asn Gly Ser Ala Gly Glu Pro Gly Pro Lys Gly Glu Arg Gly Leu 
90 95 100 

agt gga cct cca gga ctt cca ggt att cct ggt cca get ggg aaa gaa 388 
Ser Gly Pro Pro Gly Leu Pro Gly lie Pro Gly Pro Ala Gly Lys Glu 
105 110 115 

ggt ccc tct ggg aag cag ggg aac ata gga cct caa ggc aaa cca ggt 436 
Gly Pro Ser Gly Lys Gin Gly Asn He Gly Pro Gki Gly Lys Pro Gly 
120 125 130 135 

cct aaa gga gag get ggg ccc aaa gga gaa gta ggt get cct ggc atg 484 
Pro Lys Giy Glu Ala Gly Pro Lys Gly Glu Val Gly Ala Pro Gly Met 
140 145 150 

caa gga tct aca ggg gca aaa ggc tec aca ggc ccc aag gga gaa aga 532 
Gin Gly Ser Thr Gly Ala Lys Gly Ser Thr Gly Pro Lys Gly Glu Arg 
155 160 165 

ggt gee cct ggt gtg caa gga gee cca ggg aat get gga gca gca gga 580 
Gly Ala Pro Gly Val Gin Gly Ala Pro Gly Asn Ala Gly Ala Ala Gly 
170 175 180 

cct gee gga cct gcc ggt cca cag gga get cca ggt tec agg ggg ccc 628 
Pro Ala Gly Pro Ala Gly Pro Gin Gly Ala Pro Gly Ser Arg Gly Pro 
185 190 195 

cca gga etc aag ggg gac aga ggt gtt cct gga gac aga gga ate aaa 676 
Pro Gly Leu Lys Gly Asp Arg Gly Val Pro Gly Asp Arg Gly He Lys 
200 205 210 215 

ggt gaa age ggg ctt cca gac agt get get ctg agg cag cag atg gag 724 
Gly Glu Ser Gly Leu Pro Asp Ser Ala Ala Leu Arg Gin Gin Met Glu 
220 225 230 

gcc tta aaa gga aaa eta cag cgt eta gag gtt gee ttc tec cac tat 772 
Ala Leu Lys Gly Lys Leu Gin Arg Leu Glu Val Ala Phe Ser His Tyr 
235 240 245 

cag aaa get gca ttg ttc cct gat ggc cat aga aga ttg gat aag gtc 820 
Gin Lys Ala Ala Leu Phe Pro Asp Gly His Arg Arg Leu Asp Lys Val 
250 255 260 

gaa gag gaa gta aac ctt cat gaa gat ttt gta ttc ata aaa aag eta 868 
Glu Glu Glu Val Asn Leu His Glu Asp Phe Val Phe lie Lys Lys Leu 
265 270 275 

aag aga tge aac aaa gga gaa gga tct tta tec ttg ctg aac tgt gag 916 
Lys Arg Cys Asn Lys Gly Glu Gly Ser Leu Ser Leu Leu Asn Cys Glu 
280 285 290 295 

gag atg aga agg caa ttt gaa gac ctt gtc aag gat ata acg tta aac 964 
Glu Met Arg Arg Gin Phe Glu Asp Leu Val Lys Asp He Thr Leu Asn 



300 305 310 



aaa gaa gag aaa aaa gaa aac age ttt gaa atg caa aga ggt gat gag 1012 
Lys Glu Glu Lys Lys Glu Asn Ser Phe Glu Met Gin Arg Gly Asp Glu 
315 320 325 

gat cct caa att gca gca cac gtt gta age gaa gee aac agt aat gca 1060 
Asp Pro Gin He Ala Ala His Val Val Ser Glu Ala Asn Ser Asn Ala 
330 335 340 

gca tec gtt eta cag tgg gee aag aaa gga tat tat acc atg aaa age 1 108 
Ala Ser Val Leu Gin Trp Ala Lys Lys Gly Tyr Tyr Thr Met Lys Ser 
345 350 355 

aac ttg gta atg ctt gaa aat ggg aaa cag ctg acg gtt aaa aga gaa 1156 
Asn Leu Val Met Leu Glu Asn Gly Lys Gin Leu Thr Val Lys Arg Glu 
360 365 370 375 

gga etc tat tat gtc tac act caa gtc acc ttc tgc tct aat egg gag 1204 
Gly Leu Tyr Tyr Val Tyr Thr Gin Val Thr Phe Cys Ser Asn Arg Glu 
380 385 390 

cct teg agt caa cgc cca ttc ate gtc ggc etc tgg ctg aag ccc age 1252 
Pro Ser Ser Gki Arg Pro Phe He Val Gly Leu Trp Leu Lys Pro Ser 
395 400 405 

att gga tct gag aga ate tta etc aag gcg gca aat ace cae agt tec 1300 
He Gly Ser Glu Arg He Leu Leu Lys Ala Ala Asn Thr His Ser Ser 
410 415 420 

tec cag ctt tgc gag cag cag tct gtt cae ttg ggc gga gtg ttt gaa 1348 
Ser Ghi Leu Cys Glu Qhx Ghi Ser Val His Leu Gly Gly Val Phe Glu 
425 430 435 

tta caa get ggt get tct gtg ttt gtc aac gtg act gaa gca age caa 1396 
Leu Gto Ala Gly Ala Ser Val Phe Val Asn Val Thr Glu Ala Ser Ghi 
440 445 450 455 

gtg ate cac aga gtt ggc ttc tea tct ttt ggc tta etc aaa etc 1441 
Val He His Arg Val Gly Phe Ser Ser Phe Gly Leu Leu Lys Leu 
460 465 470 

tgaacagtgc gctgtcctag gctgcagcag ggtacc 1477 

<2i0>6 

<211>470 

<212>PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Murine 

surfactant protein D (except CRD) fused to the 

extracellular domain of murine CD40 ligand 



<400> 6 



Met Leu Pro Phe Leu Ser Met Leu Val Leu Leu Val Gin Pro Leu Gly 
1 5 10 15 



Asn Leu Gly Ala Glu Met Lys Ser Leu Ser Gin Arg Ser Val Pro Asn 
20 25 30 

Thr Cys Thr Leu Val Met Cys Ser Pro Thr Glu Asn Gly Leu Pro Gly 
35 40 45 

Arg Asp Gly Arg Asp Gly Arg Glu Gly Pro Arg Gly Glu Lys Gly Asp 
50 55 60 

Pro Gly Leu Pro Gly Pro Met Gly Leu Ser Gly Leu Gin Gly Pro Thr 
65 70 75 80 

Gly Pro Val Gly Pro Lys Gly Glu Asn Gly Ser Ala Gly Glu Pro Gly 
85 90 95 

Pro Lys Gly Glu Arg Gly Leu Ser Gly Pro Pro Gly Leu Pro Gly He 
100 105 110 

Pro Gly Pro Ala Gly Lys Glu Gly Pro Ser Gly Lys Gin Gly Asn He 
115 120 125 

Gly Pro Gin Gly Lys Pro Gly Pro Lys Gly Glu Ala Gly Pro Lys Gly 
130 135 140 

Glu Val Gly Ala Pro Gly Met Gk Gly Ser Thr Gly Ala Lys Gly Ser 
145 150 155 160 

Thr Gly Pro Lys Gly Glu Arg Gly Ala Pro Gly Val Gin Gly Ala Pro . 
165 170 175 

Gly Asn Ala Gly Ala Ala Gly Pro Ala Gly Pro Ala Gly Pro Gbi Gly 
180 185 190 

Ala Pro Gly Ser Arg Gly Pro Pro Gly Leu Lys Gly Asp Arg Gly Val 
195 200 205 

Pro Gly Asp Arg Gly He Lys Gly Glu Ser Gly Leu Pro Asp Ser Ala 
210 215 220 

Ala Leu Arg Gin Gbi Met Glu Ala Leu Lys Gly Lys Leu Gki Arg Leu 
225 230 235 240 

Glu Val Ala Phe Ser His Tyr Gin Lys Ala Ala Leu Phe Pro Asp Gly 
245 250 255 

His Arg Arg Leu Asp Lys Val Glu Glu Glu Val Asn Leu His Glu Asp 
260 265 270 

Phe Val Phe He Lys Lys Leu Lys Arg Cys Asn Lys Gly Glu Gly Ser 
275 280 285 



Leu Ser Leu Leu Asn Cys Glu Glu Met Arg Arg Gin Phe Glu Asp Leu 
290 295 300 

Val Lys Asp He Thr Leu Asn Lys Glu Glu Lys Lys Glu Asn Ser Phe 
305 310 315 320 

Glu Met Gin Arg Gly Asp Glu Asp Pro Gin He Ala Ala His Val Val 
325 330 335 

Ser Glu Ala Asn Ser Asn Ala Ala Ser Val Leu Gin Trp Ala Lys Lys 
340 345 350 

Gly Tyr Tyr Thr Met Lys Ser Asn Leu Val Met Leu Glu Asn Gly Lys 
355 360 365 

Gin Leu Thr Val Lys Arg Glu Gly Leu Tyr Tyr Val Tyr Thr Gin Val 
370 375 380 

Thr Phe Cys Ser Asn Arg Glu Pro Ser Ser Gb Arg Pro Phe He Val 
385 390 395 400 

Gly Leu Trp Leu Lys Pro Ser He Gly Ser Glu Arg He Leu Leu Lys 
405 410 415 

Ala Ala Asn Thr His Ser Ser Ser Ghi Leu Cys Glu Ghi Ghi Ser Val 
420 425 430 

His Leu Gly Gly Val Phe Glu Leu Ghi Ala Gly Ala Ser Val Phe Val 
435 440 445 

Asn Val Thr Glu Ala Ser Ghi Val He His Arg Val Gly Phe Ser Ser 
450 455 460 



Phe Gly Leu Leu Lys Leu 
465 470 



